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This study was an attempt to delineate some dimensions of teaching 
b style that affect the behavior of emotionally disturbed children in 
Grades 1-2 and 3-5 in regular classrooms. Video tapes were obtained 
for % day of 30 classrooms, each containing 1 or more emotionally dis- 
turbed children. Scores for deviancy and work involvement were ob- 
tained for both disturbed (E) and nondisturbed (Non E) children. 
Some findings are: (a) Pupils’ scores vary between seatwork and reci- 
tation settings in the same classroom; (b) E children manifest less 
school-appropriate behavior than Non E children; (c) teachers who 
are successful in managing the behavior of Non E children are also rela- 
tively successful with E children; (d) techniques of handling misbe- 
havior as such do not correlate with children's behavior; (e) teacher 
*With-it-ness," techniques of handling group movement, and pro- 
graming for variety change in learning activities do correlate with chil- 


dren’s behavior. 


Although authorities recognize that 


emotionally disturbed children are 
present in schools, and that the class- 
room is important in the world of the 
child, we know of no definitive re- 
search on what the classroom teacher 
can do in programing for, or in manag- 
ing the behavior of, emotionally dis- 
turbed children in regular classrooms. 
Much of the literature in this area con- 
cerns the teacher's selecting children 
to refer for treatment by some other 
person, agency, or special class; her 
providing information to these agents; 
her introducing some sort of unit on 
“human relations" or “family living"; 
or her attending a course, seminar, or 
workshop in human relations or men- 
tal hygiene. Other writings recommend 
selecting teachers who have certain 
kinds of personalities, characteristics, 
or traits such as patience, understand- 
ing, flexibility, sensitivity, insight, 
etc. No doubt the advocated character- 


* This investigation was supported by 
Public Health Service Research Grant MH- 
04221 from the National Institute of Mental 
Health. 


istics are desirable, but little has been 
written about how these translate into 
techniques a teacher ean use in manag- 
ing a classroom. 

'The present research is an attempt 
to discover techniques that teachers 
can utilize to manage the overt be- 
havior of emotionally disturbed chil- 
dren in regular classrooms. A further 
concern is whether these techniques are 
significant in the management of emo- 
tionally disturbed children only, or 
whether they are also significant in the 
management of normal children. 

After a year spent observing regular 
classrooms containing emotionally dis- 
turbed children, the authors were 1m- 
pressed by (a) the differences in the 
behavior of these children with differ- 
ent teachers, (b) a tendency to “label” 
both teachers and children, (c) a tend- 
ency to note perceptually outstanding 
aspects of a classroom and to attribute 
the behavior of children to these, and 
(d) the general complexity and variety 
of events occurring in a classroom. The 
authors concluded that this type of re- 
search required a medium that would 
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provide a nonselective, complete, and 
objective record of all observable 
events in a classroom, one that would 
allow many viewings of the same 
events so that a variety of variables— 
for the teacher and subsettings as well 
as for the children—could be studied. 
The method chosen should not only 
provide a means of measuring the va- 
lidity of existing hypotheses and no- 
tions, but should also provide data on 
variables or hypotheses unformulated 
at the time of data gathering. 


DATA-GATHERING PROCEDURE 


The following method was used for 
gathering data: 

1. Two TV cameras were mounted in 
nearly soundproof boxes and placed on 
stands in a classroom. One camera, kept 
stationary, was placed in a position to record 
the largest possible area of the classroom con- 
tinuously and without “editing”; the other 
camera, equipped with a pan-tilt mechanism, 
enabled the engineeer to record various posi- 
tions in the classroom. The decision about 
what to record and where to superimpose 
this picture upon the major picture was 
made by the project director, Priority was 
given to video taping the teacher or the 
emotionally disturbed children when they 
were “lost” from the main picture. 

2. All recording was controlled by engi- 
neers using equipment in a truck parked out- 
side the school building, 

3. Two microphones were used to record 
sound; one was a wireless radio transmitting 
microphone worn by the teacher that made 
possible the recording of everything she said. 

Video tapes of classrooms were obtained 
from two predominantly middle-class sub- 
urbs. Suburb X was involved in a study 
aimed at evaluating special classes, in the 
first two grades, for emotionally disturbed 
children. Referred children were individually 
administered approximately 14 hours of psy- 
chological, educational, psychiatric, and neu- 
rological tests. Some children were assigned 
to special classes (experimental children), 
while “matched” children (control cases) re- 
mained in regular classes. Video tapes were 
obtained for (a) the two special classes con- 
taining 14 experimental children, but not in- 
cluded in this report, and (b) the 15 regular 
classrooms containing the 14 matched chil- 
dren along with 24 children diagnosed as 
emotionally disturbed by means of the same 


diagnostic procedures, but not assigned to 
experimental or control groups at the time 
this study was conducted. 

In Suburb Y, 15 classrooms in Grades 3-5 
containing one or more emotionally dis- 
turbed children (totalling 32) were video 
taped. The children were selected from a list, 
provided by principals, of children who had 
been given individual psychological and psy- 
chiatric tests by the staff of a nearby chil- 
dren’s psychiatric hospital. 

All classrooms were video taped contin- 
uously, for an entire morning or afternoon, 
except when the children left the room, as for 
recess or gymnasium periods. 


Соріха PROCEDURES 


Only behavior in academic activities— 
reading, arithmetic, social studies, science, 
ete.—was coded. Child behavior during non- 
academic activities (attendance, milk time, 
nonacademic games) was not coded. Scores 
were kept separately for four types of aca- 
demic subsettings: (a) subgroup recitation 
periods (Sub Ree), where the teacher worked 
with a subgroup of the class (usually a read- 
ing group) while the remainder of the chil- 
dren worked independently at their scats; 
(b) entire-class recitation (En Rec) periods, 
where the teacher worked with the entire 
class in a recitation period (explaining, dem- 
onstrating, questioning, etc.); (c) seatwork 
nonsphere periods (SW non Sp), where п 
group of children engaged in some kind of 
seatwork without direct supervision by the 
teacher at the time (occurred in conjunction 
with Sub Rec); and (d) seatwork sphere 
(SW Sp), where the entire class engaged in 
seatwork while the teacher worked with 
them (circulating, supervising, occasionally 
commenting, ete.). 

The authors' question about the children's 
behavior was simple: To what degree is their 
behavior school appropriate? The authors 
attempted to measure child behavior which 
would reflect teachers’ success in (a) getting 
the children to engage in the classroom ac- 
tivities (in this case, the academic learning 
activities), and (b) keeping them from mis- 
behaving and disrupting the legitimate be- 
havior of others, 

This paper reports on two of these child- 
behavior codes: work involvement and de- 
viancy rate, 

_ The behavior of from two to eight non- 
disturbed children (Non E), in addition to 
that of each emotionally disturbed child (Е), 
Was coded for each of the four subsettings- 
The Non E children were selected to repre- 
Sent the four quadrants of the seating aT- 
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rangement during the activity. The number 
of Non E children whose behavior was coded 
varied according to the number of children 
in the subsetting (some Sub Rec groups 
contained as few as two children), and the 
number that were clearly visible. All E chil- 
dren were coded throughout the academic 
subsettings in which they were present and 
their behavior was codable. 

The behavior being coded was categorized 
every 10 seconds. The categories were: (a) 
definitely and completely involved in work 
(DI), (b) probably involved in work (PI), 
(c) definitely not involved in work (DO), 
(d) restless, (e) languishing, (f) languishing 
and restless, (g) engaged in task-related de- 
viancy (TR), (h) engaged in non-task-re- 
lated deviancy (NTR) that was mild or 
serious. In the case of the E children, the 
contagion of their deviancy was coded: 
Whenever an E child engaged in a NTR, the 
behavior of the children in this vicinity was 
also coded at the time. 

Scores for each classroom and each sub- 
setting were kept separately for E and Non 
E children. Three scores will be discussed 
here: work-involvement rate, or the mean 
DI/DO ratio; deviancy rate, or the mean 
percentage of 10-second intervals that con- 
tain a tally for NTR—mild NTR being 
scored 1, serious NTR being scored 2 (Fewer 
than 1% of the intervals in the regular 
classrooms contained a serious deviancy— 
overt aggressive acts, open defiance of 
teacher—and these were found in but five 
of the classrooms.); and contagion rate of 


NTR's for E children, or the mean percentage 
of children in the vicinity of the E who 
entered into a deviancy initiated by the E. 

Intercoder agreement between various 
pairs of the four coders ranged from 82%- 
96% agreement with a mean of 92%. 

The teacher-style codes selected for con- 
sideration in this paper will be described in 
the Results section. 

Insofar as possible, the authors attempted 
to adhere to the following principles in de- 
veloping codes for teacher style: (a) The 
categories should designate concrete behav- 
iors, that is, specific managerial technologies, 
rather than "traits" or “characteristics” of 
teachers; (b) dimensions should be those on 
which teachers differed. For example, al- 
though all teachers designate times for chil- 
dren to make a transition from one activity 
to another, not all teachers do it in the same 
manner; (c) the dimensions should be as 
free as possible of commitment to any phil- 
osophical orientation or value system. 


RESULTS 


Table 1 presents a summary of the 
work-involvement and deviancy scores 
for both the E and Non E children in 
the four major activity subsettings. 
(The Ns vary between subsettings be- 
cause not all teachers programed all 
four subsettings. The Ns vary between 
E and Non E groups because E chil- 
dren were not always present in the 


TABLE 1 
MEAN Wonk-INvoLvEMENT AND Deviancy Scores or EMOTIONALLY DISTURBED 
AND NONDISTURBED CHILDREN IN Four CLASSROOM SUBSETTINGS 


Emotionally dis-| Nondisturbed 
turbed children children 
Subsetting t 
Score N Score N 
Ratio 
Definitely in/definitely not involved in work 
Subgroup recitation 76.41 19 | 88.7] 21 | 2.66* 
Class recitation 59.6| 19 | 78.4] 19 | 2.67* 
Seatwork in teacher’s sphere 50.6 | 14 | 67.5] 14 | 2.18* 
Seatwork not in teacher’s sphere 44.8 | 20 | 60.5| 21 | 2.28* 
Non-task-related deviancy score 
Subgroup recitation 2.6| 19 1.9| 21 | 5.49** 
Class recitation 5.9 19 2.8 19 2.95%" 
Seatwork in teacher’s sphere 15.4 14 5.6 14 2.26* 
Seatwork not in teacher’s sphere 13.0} 20 | 10.3) 21 ‚79 


*p < 05. 
** p < O01. 
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subsetting coded or because an ade- 
quately visible sample of Non E chil- 
dren was not always present.) 

In all subsettings, the behavior of the 
Non E children was more school ap- 
propriate than that of the E children. 
This difference apeared in both work- 
involvement scores and  deviancy 
Scores. 

'The behavior of all children varied 
with the classroom setting. The highest 
degree of school-appropriate behavior 
occurred in subgroup recitation periods 
(mostly reading groups), and the low- 
est degree of such behavior occurred 
in seatwork situations. The differences 
between combined recitation periods 
and seatwork periods were significant 
at the .01 level or better for both the 
DI/DO ratio and the NTR score and 
for both E and Non E children. 

It can be concluded that, although 
the school behavior of E and Non E 
children differs in degree of school ap- 
propriateness, both types of children 
change similarly in behavior accord- 
ing to the classroom situation. 

Are special techniques necessary to 
manage the behavior of emotionally 
disturbed children in regular class- 
rooms—techniques that are different 
from those used to manage other chil- 
dren? Or are the necessary managerial 


techniques essentially the same for all 
children in a classroom? A partial 
answer to this question can be ob- 
tained by seeing whether E and Non E 
children respond in the same manner 
to different teachers. 

The correlations between the behav- 
ior of E and Non E children are pre- 
sented in Table 2. Because of subset- 
ting differences, these are presented for 
the four subsettings separately and for 
recitation periods combined and seat- 
work periods combined. Due to differ- 
ences between Grades 1-2 and Grades 
3-5 (i.e., NTR rate tended to be lower 
and DI/DO ratios higher in the upper 
grades), results are given for all grades 
combined and for the lower grades sep- 
arately. 

In view of the r’s shown in Table 2, 
it can be concluded that the degree of 
school-appropriate behavior mani- 
fested by Non E children is positively 
related to the degree of school-appro- 
priate behavior exhibited by E chil- 
dren in the same classroom, Insofar as 
the behavior of the children can be at- 
tributed to the managerial success of 
the teacher, teachers who are success- 
ful in managing a classroom are also 
relatively successful in managing the 
emotionally disturbed children present 
in that classroom. 


TABLE 2 
SPEARMAN r'5 BETWEEN THE BEHAVIOR OF EMOTIONALLY DISTURBED 
AND NONDISTURBED CHILDREN 


N |volvement | N 


All grades Grades 1 & 2 
Subsettin 
Ы ЕРЙ Work-in- Tei Work-in- 
Y| N volvement | N many 


rate 


ratio rate ratio 


Seatwork in teacher’s sphere .381 

Seatwork not in teacher's sphere .641** 
All seatwork ATIS 
Subgroup recitation .558** 
Class recitation .T01** 
All recitation SU 


14 | .398 14 — 3 — 3 
18 | .581** | 18 | .806** | 10 | .488 10 
25 | .611** | 25 | .697* 10 | .600* 10 
18 | .194 |17 | .906** | 10 | .750** | 9 
19|.554* | 7| — 6; — 6 
27 | .638** | 25 | .856** | 13 | .734** | 12 


* p < .05; one-tailed test. 
** p < .01; one-tailed test. 
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Teachers sometimes express concern 
that emotionally disturbed children 
disrupt the behavior of others. Is this 
disruptive influence a consequence of 
the deviant behavior of emotionally 
disturbed children or is it generally 
symptomatic of unsuccessful classroom 
management? 

In an attempt to answer this ques- 
tion, we estimated as follows the con- 
tagion effect of the misbehavior of E 
children: Whenever an E child in- 
itiated deviant behavior, the children 
in his vicinity were coded as “getting 
in” with it or as remaining with school- 
appropriate behavior. The percentage 
of children disrupted by the E child’s 
misbehavior was determined, and an 
average contagion rate per incident 
was computed for each classroom. This 
was compiled for all SW settings com- 
bined and correlated with the deviancy 
rate of Non E children in all SW set- 
tings. (Rec settings were not used here 
because of the low frequency of E child 
deviancies in Rec subsettings.) For all 
grades combined, the correlation was 
829 (N = 25; р < .01). For Grades 1 
and 2 the correlation was .718 (N = 
10; p < .01). 

It may be concluded that the degree 
of contagion of an E child's deviant 
behavior is related to the degree of 
misbehavior occurring in the class- 
room. The teacher who is generally 
successful in managing the surface be- 
havior of children in her classroom not 
only has a lower deviancy rate among 
emotionally disturbed children in that 
classroom, but also produces a class- 
room climate that “contains” the mis- 
behavior of an E child and prevents it 
from disrupting the behavior of other 
children. 

Are there concrete teaching tech- 
niques from which one can predict the 
school-appropriate behavior of chil- 
dren? If so, do these techniques have 
the same effect upon the behavior of 


both emotionally disturbed and non- 
disturbed children? 

The first, or Desist, code dealt with 
what one might call “direct” manage- 
rial techniques: the techniques of deal- 
ing with misbehavior as such. Every 
time the teacher did something to stop 
& child's misbehavior, the incident was 
coded for how the teacher did it and 
the degree of success she had in stop- 
ping the misbehavior. 

The degree of success was coded on a 
7-point scale. This ranged from the 
target child's continuing to misbehave 
on one end, to the child's stopping the 
misbehavior immediately and engag- 
ing in the required work on the other. 
Intercoder agreement averaged 74%; 
agreement within 1 point averaged 
9596. 

The Desist technique was rated on 
(a) clarity, (b) firmness (“Т mean it” 
quality), (c) child treatment (nega- 
tive, neutral, positive), (d) intensity, 
and (e) focus (misbehavior, i.e., “stop 
that talking,” or task, i.e., “do those 
arithmetic problems,” or both misbe- 
havior and task). Intercoder agree- 
ment averaged 90%. 

For only 1 of 30 teachers was there 
a significant correlation between any 
aspect of her Desist technique and the 
degree of immediate success of the 
effort. Thus, for any one teacher, the 
degree of clarity, firmness, etc., of her 
behavior during the event did not re- 
late to the target child’s stopping his 
deviancy or taking up the prescribed 
task. 

Teachers were also ranked according 
to their mean Desist-style scores for 
clarity, firmness, punitiveness, etc. 
These rankings did not correlate with 
the rankings on deviancy or work in- 
volvement for either their E or their 
Non E children. 

It may be concluded that the man- 
ner of dealing with misbehavior as 
such is not a significant determinant 
of how well children behave in a class- 
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room. 'This generalization should be 
limited to the type of misbehavior here 
considered (i.e., fewer than 1% of de- 
viancies were rated as "serious"), the 
kinds of Desist techniques used by the 
teachers (ie., while high anger did 
occur, no extreme punitiveness was ob- 
served, such as whipping or expelling 
during the times classrooms were video 
taped), and the length of time the 
classrooms had been in existence (i.e. 
video tapes were taken in approxi- 
mately the eighth month of school). A 
history of firmness, clarity, or puni- 
tiveness in desisting may have oper- 
ated to reduce the significance of these 
variables. Kounin and Gump (1958) 
found that clarity, firmness, and 
roughness did make a difference in 
conformity-deviancy of children who 
were observers of Desist events during 
the first 4 days of kindergarten; they 
also found more “no observable ef- 
fects” during the second through 
fourth day than on the first day. 

Desist events were coded for two 
other categories of teacher style which 
could be judged from such events, 
namely, ‘Overlappingness” and 
“With-it-ness.” 

Overlappingness refers to the degree 
to which a teacher overtly manifests 
signs of attention to two aspects of 
a classroom simultaneously. When two 
issues are confronted, does she attend 
to both or drop one completely and go 
“all out” to the other? For example, 
she may be working with a reading 
group when a deviancy occurs in the 
seatwork group. She may issue a Desist 
order to the child in the seatwork 
group while simultaneously attending 
to the reading group (remaining in the 
same seat, instructing the reader to 
continue reading, scanning the reader 
while desisting the deviant). Or, she 
may immerse herself in the deviant 
and completely drop the reading group 
for the time (changing manner and 
geographically moving away without 


any instructions to, or scanning of, the 
reading group). 

With-it-ness refers to the degree to 
which the teacher communicates to 
the class the impression that she does 
or does not know what is going on 
(has the proverbial “eyes in back of 
her head"). In desisting, the teacher 
communicates that she is not “with it” 
by (a) desisting a minor deviancy 
while a major one is not acted upon 
(desisting two children for whispering 
while two others are throwing paper 
airplanes or chasing each other), or 
(b) desisting too late (after a deviancy 
has increased in seriousness or has be- 
gun to spread to other children). 

Teachers were scored for Overlap- 
pingness on the basis of the percentage 
of their desists in which the teacher 
went all out to handle a deviancy and 
completely dropped the ongoing ac- 
tivity; they were scored for With-it- 
ness on the basis of the percentage of 
their desists that were aimed at the 
wrong target or came too late. 

All correlations between teacher 
style and child behavior to be reported 
are between hypothesized good aspects 
of teaching style and desirable child 
behavior. For example, in setting up 
ranks for Spearman 7’s the classroom 
with the lowest deviancy rate would 
be ranked 1; the teacher with the 
highest With-it-ness would be ranked 
1. A positive correlation between a 
teacher dimension and pupil deviancy 
or work involvement means that the 
teachers with more of the dimension 
had less deviancy and more work in- 
volvement. 

Overlappingness, contrary to our ex- 
pectations, did not correlate signifi- 
cantly with either deviancy or work 
involvement for either the E or Non E 
children. However, Overlappingness 
did correlate with With-it-ness (r = 
498; p < .01, N = 30). With-it-ness, 
as can be seen in Tables 3 and 4, did 
correlate in many subsettings with 
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TABLE 3 
SPEARMAN r’S BETWEEN THE WITH-IT-NESS OF TEACHERS AND THE 
Deviancy RATE OF CHILDREN 
All grades Grades 1 & 2 
Subsetting — Emotion- T Smotionall 
Nested y | aly е | буде [ov [rinsed | N 
children children children 
Seatwork in teacher's sphere .439 14 | .547* | 14 — 3 — 3 
Seatwork not in teacher's 
sphere .386* | 20 | .517* | 19 559* | 11 .709* | 10 
AII seatwork .480** | 27 | .524** | 25 .568* | 11 .685* | 10 
Subgroup recitation .361 21 | .432* | 18 .321 |11 .191 |10 
Class recitation —.108 20 | .327 18 | —.136 6 .179 7 
All recitation .198 27| .413* |26 | —.021 12 | —.066 | 12 


* p < .05; one-tailed test. 
** р « .01; one-tailed test. 


both devianey and work involvement 
—the higher a teacher's degree of 
With-it-ness, the higher the work in- 
volvement of the children and the 
lower their deviancy rate. It should 
also be noted that correlations for E 
children were always in the same di- 
rection as for Non E children. In fact, 
the significant correlations obtained 
were either significant for the E chil- 
dren and not for the Non E children, 
or tended to be higher for the E than 
Non E children. 

Since With-it-ness and Overlapping- 
ness correlate with each other, why 


does only With-it-ness correlate with 
children's behavior? Our current inter- 
pretation is that, Overlappingness en- 
ables a teacher to receive information 
about what is going on, but that the 
children do not perceive or categorize 
this teacher dimension, or even notice 
it. What they do see and react to, how- 
ever, is what the teacher does and 
communicates—she stops deviancy in 
time or she makes mistakes in picking 
deviancy targets when she is perceived 
by the children as knowing what is 
going on regarding their behavior, she 
has more surface managerial success, 


TABLE 4 
SPEARMAN r'5 BETWEEN THE WiTH-IT-NESS OF TEACHERS AND THE WORK-INVOLVEMENT 
RATE or CHILDREN 
All grades Grades 1 & 2 
Subsetting Nondis- Emotion- Jondi Emotion. 
Abel N ally dis- N Nondis- ally di S 
1 ла turbed | N IN 
children ade children E 
Seatwork in teacher's sphere .230 | 14 | .766** | 14 = 3 =< 
Seatwork not in teacher’s sphere .199 | 20 | .293 19 .396 | 11 697* Es 
Allsentwork —— .227 | 27 | .501** | 25| .505 | 11| .797* | 10 
Subgroup recitation .020 | 21 | .534* | 18}  .182 11} .233 | 49 
Class recitation —.205 18|.335 | 18} —.657| 6| 607 | 7 
All recitation — .042 | 27 | .398* 25 .073 | 12 | .448 12 


* p < .05; one-tailed test. 
** у < .01; one-tailed test. 
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that is, induces worklike behavior and 
restrains deviancy. Receiving or hav- 
ing knowledge is not sufficient unless 
the fact that this occurs is communi- 
cated to children. 

One function of a teacher is to in- 
itiate activities and to maintain move- 
ment within them. Transition inci- 
dents should provide points where one 
could sample the manner in which a 
teacher manages the activity flow of 
a classroom. Transition points are 
those times when the children are in- 
structed to stop an old activity (“Put 
spelling papers away.") and start a 
new one (“Get your arithmetie books 
out.”). Old and new activities were 
distinct periods that teachers and pu- 
pils would presumably recognize and 
label in the same manner. They gen- 
erally correspond to common content 
categories: arithmetic, spelling, read- 
ing group, etc. 

All transition points were coded for 
the teacher's treatment of the activity 
to be stopped and of the activity to be 
Started. The code included such items 
as the behavior mode of the children 
at the time of the transition; the pres- 
ence of simple delineation or delinea- 
tion with Clarity, Group Alerting, 
Challenge-Arousal, Zest; the attention 
property of the signal; the presence 
of antiresolution properties; the ap- 
propriateness of timing; and the chil- 
dren’s immediate reaction to the in- 
duction. 

Of particular relevance at this time 
are the following teacher behaviors, 
designated as antiresolution proper- 
ties, that oppose her overt intent to 
have the children stop the old activity 
and begin the new one: 

1. Providing contrary props, or pas- 
sing out props that were judged to lure 
the children away from the teacher- 
designated official new activity—for 
example, passing out spelling tests the 
children had previously taken while 


telling the children to start their arith- 
metic. 

2. Returning to old activity, or des- 
ignating a new activity and then in- 
itiating an issue pertaining to the 
old—for example, asking how many 
children had all their spelling words 
right after having told them to stop 
spelling and start arithmetic. 

3. Dangling, or signaling the pupils 
to start a new activity, and then drop- 
ping it in midair, so to speak, leaving 
the children with nothing to do. 

4. Thrusting, or injecting a transi- 
tion order at a time or in a manner in- 
dicating the teacher’s unawareness of 
the children’s receptivity to this mes- 
sage. For example, a teacher had been 
working with a reading group while 
the Robins were doing arithmetic seat- 
work. Suddenly, and in a manner in- 
dicative that her own intent was the 
only determinant of the timing—she 
had not looked up to see what the 
Robins were doing nor given verbal 
warning—the teacher thrust the or- 
der: “Robins, take out your reading 
books." A thrust is similar to a person 
butting in on the conversation of two 
other persons without either waiting 
for them to notice his presence or eas- 
ing in by listening to what they are 
talking about. 

5. Giving conflicting, confusing di- 
rections, or giving too many orders or 
contradietory things for the children 
to get with. 

Teachers were scored and ranked 
on the basis of the percentage of their 
transitions that contained at least one 
of the above antiresolution properties. 

The correlations between teachers' 
antiresolution scores during transition 
periods and children's behavior are 
presented in Tables 5 and 6. A positive 
correlation means that teachers with 
the lowest percentage of antiresolution 
techniques have the least deviancy OT 
most work involvement in their class- 
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TABLE 5 


SPEARMAN r’S BETWEEN TEACHERS’ TRANSITION-RESOLUTION SCORES AND THE 
Deviancy RATE OF CHILDREN 


All grades Grades1 & 2 

Subsettii СИРР ion- CNET Emotion- 
pe Nopi- | SEE CAPERE ME aly dis- N 

children ирей children Een 
Seatwork in teacher's sphere .936 | 14 | .227 | 14 — 3 — 3 
Seatwork not in teacher's sphere „Зат |20| .529* | 19 | .425 11 | .627* | 10 
All seatwork .308 27| .310 25 | .425 11 | .670* | 10 
Subgroup recitation .419* | 21 | .530* | 19 | .527* | 11 | .282 10 
Class recitation .155 20 | .336 18 | .071 6 357 7 
All recitation .350* | 27 | .386* | 26 | .397 12 | .302 12 


* р < .05; one-tailed test. 


rooms. There are a sufficient number of 
significant correlations to justify the 
conclusion that one meaningful param- 
eter of classroom management pertains 
to techniques of managing transitions. 
This dimension of classroom manage- 
ment, moreover, applies to both E and 
non E children in the same direction. 
Having established a relationship 
between the techniques a teacher uti- 
lizes in directing children’s movements 
from one activity to another and the 
degree to which she succeeds in in- 
ducing school-appropriate behaviors in 
these activities, one may well ask 


about the nature of the activities in 
which movement is programed. 

One question about the programed 
activities in the school day has to do 
with the length of activities. A concept 
in common usage among teachers is 
that of “attention span"—the length 
of time that a child can remain in- 
volved in an activity before becoming 
“bored,” restless, and desirous of quit- 
ting. 

Having delineated transition points 
between one activity and another in all 
teachers’ programs, the authors could 
measure the average length of activ- 
ities by dividing the length of total 


TABLE 6 


SPEARMAN т?з BETWEEN TEACHERS’ TRANSITION-RESOLUTION SCORES 
AND THE Wonx-INvOLVEMENT КАТЕ OF CHILDREN 


All grades Grades 1 & 2 
Subsetting К Pinotion- à 
Nondis- is- Nondis- Emotion- 
turbed | N | бәү | ar | turbed ally dis- 
сша children children due | N 
Seatwork in teacher’s sphere .486* | 14 | .222 14 _ z 
Seatwork not in teacher’s sphere .872 | 20 | .474* |19| .380 | 11 Бб 3 
Allseatwork ——- -426* |27 | .642** | 25 | “49g |11] `600* | 10 
Subgroup recitation .495* | 21 | .592** | 19 | .536* | 11 d 10 
Class recitation .955 | 18 | .362 18| 1057 ы m 10 
All recitation -394* | 27 | `618** | 25 | .547* |12| C21 | 1 


* p < .05; one-tailed test. 
** p < 01; one-tailed test. 
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time observed by the number of ac- 
tivities. Teachers were then ranked on 
the basis of the average length of time 
that they required the children to re- 
main in an academic activity. 

In no case did the authors find a sig- 
nificant correlation between average 
duration of academie activities and 
deviancy rate or work-involvement 
rate of children. Correlations were 
computed for activity length for both 
E and Non E children separately in 
the upper and lower grades and for all 
grades combined. 

It must be concluded that attention 
span, when this concept is made equiv- 
alent to time as such, is not a signifi- 
cant aspect of activity programing 
in elassrooms for either emotionally 
disturbed or nonemotionally disturbed 
children. 

The authors then attempted to look 
at programing from the viewpoint of 
satiation theory. Satiation is defined 
as the process of decrease in liking for 
an activity due to repetition. In terms 
of the child-behavior variables meas- 
ured in this study, it would be expected 
that increasing satiation would result 
in decreased work involvement and in- 
creased deviancy. 

Experimental studies of satiation 
have demonstrated that felt progress 
and variety decrease the rate of satia- 
tion. The video tapes, being restricted 
to data about overt behavior, could 
not provide measures of how much 
progress a child felt. Conceivably, 
however, it could be possible to obtain 
measures of the amount of change and 
variety programed into the classroom 
day (or, in this case, half day). 

'The problem was to determine how 
different one activity is from another. 
Doing arithmetic is some change from 
doing spelling words. However, there 
are also similarities: children are sit- 
ting, writing, at the same seats, ete. It 
is also obvious that more change is in- 
volved in a transition from “seatwork 


spelling" to “recess” than in a transi- 
tion from “seatwork spelling" to "seat- 
work arithmetic." 

Accordingly, a nine-category coding 
System was designed to measure the 
amount of change between one activity 
and another: Content—19 categories 
such as reading, arithmetic, drawing, 
geography; Content Category—aca- 
demie, semiacademic, and nonaca- 
demic; Teacher Presentation Pattern 
—eight categories such as lectures, 
questions; Props—írom routine to 
unique; Group Configuration; Child 
Responsibility—such as listens pas- 
sively, initiates own pace; Leadership 
—such as teacher, alone, another child; 
Overt Behavior Mode—four degrees 
from sedentary to physically vigorous 
activity; and Location—five subcate- 
gories, such as own seat, outside of 
room. 

Each activity was coded for each of 
the nine major categories; change was 
scored at transition points. A change 
in any category was scored 1, and the 
number of categories scored as 
changed were summed to obtain a 
transition variety-change score. At 
any one transition a classroom could 
obtain a score of from 1-9. (A content 
change or teacher-sphere change was 
minimally required to designate a 
transition.) The sum of these scores 
was determined for each teacher. This 
Score represents a combination of va- 
riety and contrast: The score is in- 
creased by increasing the variety of 
activities entered and by increasing 
the amount of difference between tem- 
porally adjacent activities. 

No significant positive correlations 
were found between the amount of 
programed variety change in a teach- 
er's half day and the degree of school- 
appropriate behavior of children. Of 
64 correlations—no correlations were 
computed for “cells” containing fewer 
than 8 cases after being broken down 
by grade, setting, and E or Non E— 
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26 were negative, and 3 of these were 
significant at the .05 level—a finding 
that could be accounted for by chance. 

One may conclude that the sheer 
amount of variety and change that a 
teacher programs in a classroom is not 
à dimension significant to managerial 
success. 

This absence of a positive correla- 
tion was puzzling. Perhaps the “coffee- 
break” theory of handling problems of 
work satiation was pertinent. (Coffee 
breaks make the total work situation 
more tolerable, but do not actually 
change the attraction of the work as 
Such; thus, the attractiveness of 
“tightening the bolt over and over” is 
not changed by the nonrelated cofiee.) 

Accordingly, a variety-change score 
was developed, based only on school- 
unique activities, namely, those ac- 
tivities that most children would per- 
ceive as concerned with learning or 
skill development, excluding the va- 
riety-change scores of entertainment, 
rest, physical exercise, recess, and non- 
academic game activities. 

The correlations between overall 
variety change for learning-related 
activities and children’s behavior were 
higher than those obtained between 
children’s behavior and variety change 
for all activities. (Of 64 pairs of cor- 
relations, 59 were higher.) And, except 
for a few negative correlations for the 
upper grades only, all were positive. 
However, significant positive correla- 
tions were obtained only in recitation 
subsettings, and only for all grades 
combined or for Grades 1 and 2 only. 
Taken separately, none of the correla- 
tions for the upper grades were statisti- 
cally significant. For subgroup recita- 
tion settings, the following correlations 
were obtained in Grades 1 and 2 with 
deviancy rates: .72 for all children 
(p < .01, N = 11); .65 for Non E chil- 
dren (p « .05, N = 11); .59 for E chil- 
dren (p « .05, N — 10). With work- 
involvement rate, the correlations were 


5 for all children (p < .01, N = 11); 
5 for Non E children (p < .01, N = 
11); and .67 for E children (p « .05, 
N = 9). 

The above correlations support the 
theory that programing for variety 
change is a significant dimension of 
classroom management. It should be 
noted, however, that variety change 
within an activity was not scored. 
Would differentiating within-activity 
variety change provide a more refined 
analysis of teaching style? 

To obtain refined variety-change 
scores for seatwork nonsphere settings, 
three refinements were added to the 
previous coding and scoring. One deals 
with the type of intellectual challenge 
required in the seatwork assignment; 
four categories were designated: (a) 
requires attention and perseveration 
only, such as copying words; (b) re- 
quires perseveration plus skill usage, 
such as doing addition of arithmetic 
numbers; (c) requires problem solu- 
tion, such as solving word problems in 
arithmetic; and (d) requires creative 
effort, such as making up a story. The 
second refinement in the previous scor- 
ing deals with the designation of post- 
termination activities—‘“fillers.” Does 
the teacher provide activities for the 
children when their assigned seatwork 
is finished, and, if so, how different are 
these activities from the rest of the 
seatwork? The authors reasoned that 
making available permissible school- 
type activities for children to engage 
in after required work is completed 
constitutes an antisatiation technique. 
A change score was thus computed be- 
tween the last official seatwork task 
and the official filler, and added to the 
previous change score for seatwork 
activities. 

Third, the variety-change score was 
refined by dividing the time spent in 
seatwork by the total variety-change 
score. This provided an index of seat- 
work variety change. The lower the 
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index, the greater the experienced va- 
riety change in a period of time. 

The results for this analysis are 
presented in Table 7. It can be seen 
that programed variety change within 
a subsetting is significantly related to 
the school-appropriate behavior of 
children in that subsetüng, and that 
the direction of the relationship is the 
same for both E and Non E children. 
(SW Sp subsettings were not analyzed, 
as only three classrooms in Grades 1 
and 2 had these subsettings. The de- 
velopment of a code for variety change 
in recitation subsettings has not yet 
been completed.) 

Unlike the other teacher-style di- 
mensions where the direction of the 
relationship with pupil behavior was 
the same for the lower and upper 
grades, the correlations of variety 
change are in opposite directions for 
the lower and upper grades. 

Programing to reduce satiation is a 
significant dimension of teacher style, 
though this must be planned differ- 
ently for younger than for older 
groups. It should be noted that, in the 
sample studied, the average variety- 
change index for both grade levels was 
15 minutes. Since the duration of ac- 
tivities as such has not been found to 
be a significant variable, this differ- 


ence between the lower and upper 
grades cannot be explained by the 
single statement that younger children 
have “shorter attention spans.” One 
possibility is that “felt progress in 
learning” is a more important anti- 
satiation factor for the older children 
than for the younger children. Experi- 
enced variety may be relatively more 
salient for the younger children. The 
older children, then, might require ac- 
tivities of sufficient length to enable 
them to experience progress in learning 
and competence. 

Programed variety change is also 
relevant to the problem of the disrup- 
tive effects of the deviant behavior of 
disturbed children. The correlations 
between variety change and contagion 
rates are in the same direction as those 
for deviancy and work involvement. 
For Grades 1-2 these are .658 (p < 
.05, N = 10) and for Grades 3-5, 
—.357 (N = 7). (There were only 
seven classes among the upper grades 
in which E children initiated a devi- 
ancy during SW.) 

This study does indicate that dimen- 
sions of concrete teacher techniques 
can be delineated that make a differ- 
ence in how children behave in class- 
rooms. What is more, these are not 
unique for the emotionally disturbed 


TABLE 7 


SPEARMAN r's BETWEEN SEATWORK VARIETY CHANGE AND 
CHILDREN’S BEHAVIOR IN SEATWORK 


Grades 3-5 


Grades 1-2 


Deviancy Work involvement 


Deviancy Work involvement 


Emotionally | Nondisturbed Emotionally 


x Н Nondisturbed | Emotionally | Nondis. | Emotionally isturbed 
disturbed disturbed Ңң disturbed isturbi Nondisturbe 

children children children children children AM. Red children 
Rate |N | Rate |N | Ratio | N | Ratio | N| Rate |N | Rate | | Ratio | N | Ratio | N 
—.413 | 9 | —.667* | 9 | —.250 | 9 | —.667* | 9 | .600* |10|.s73** | 11 | .539* | 10| .807* |11 


* p < .05; one-tailed test. 
** p < 01; one-tailed test. 
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children. In all cases, the direction of 
effect of a particular parameter of 
teaching style was the same for dis- 
turbed as for nondisturbed children. 
One might also consider the impli- 
cations of the findings of this study in 
relation to the training of teachers. 
For one thing, these findings point to 
the necessity of discovering dimensions 
of teaching style that are relevant 
to the ecology of the classroom and 
to a teacher’s position in this setting. 
They justify a degree of skepticism 
about extrapolating dimensions of 
adult-child relations from other set- 
tings (homes, psychotherapy clinics) 
and applying these directly to teach- 
er-child relations. They also raise 


9 


questions about the fruitfulness of ana- 
lyzing teachers on the basis of “person- 
ality” characteristics as compared to 
concrete techniques of programing ac- 
tivities and initiating and maintaining 
movement in the program. And, with- 
out the intent of minimizing the im- 
portance of studying individual chil- 
dren, the findings do suggest placing 
a higher priority on training for group 
management than is currently em- 
phasized in educational psychology 
curricula. 
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The purpose of this study was to investigate the influence of achieve- 
ment motivation and test anxiety on performance in programed in- 
struction. À quasi-projective measure of achievement imagery, a test- 
anxiety questionnaire, and an achievement, pretest were administered 
to 139 undergraduate pupils and related to their performance on pro- 
gramed instruction. Performance was analyzed using 3 criteria: (a) 
time needed to complete the material, (b) number of incorrect re- 
sponses, and (c) a short-term retention test. The high-achievement- 
motivated students performed better on all 3 criteria and hence learned 
more efficiently with programed instruction than low-achievement- 
motivated students. High-test-anxiety students also worked faster and 
made fewer errors than low-anxiety students, but failed to exhibit 


higher retention scores. 


Considerable attention has been 
given in recent years to the study of 
achievement motivation and its con- 
comitant effect upon classroom learn- 
ing. MeClelland (1951) and his fol- 
lowers have spurred much of this 
interest by providing ways to objec- 
tively measure or identify achievement 
motivation. Yet the research up to this 
point has not produced any clear-cut 
agreement with regard to the influence 
of achievement motivation within the 
classroom. 

When grade-point averages (GPAs) 
are employed, as they often are, to 
represent scholastie achievement, the 
disparity becomes even clearer. Stu- 
dies, for example, by McClelland, 
Atkinson, Clark, and Lowell (1953) 
and by Weiss, Wertheimer, and Groes- 
beck (1959) have reported moderate 
correlations of .51 and .34, respec- 
tively, between achievement motiva- 
tion and GPAs of subjects (Ss), using 
college students as Ss. Other studies 
by Lowell (1952) and Parrish and 
Rethlingshafer (1954), for example, 
have been unable to show any signifi- 
cant relationship between grades and 
achievement motivation. Admitting 
that GPAs represent a rather general 
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estimate of academic achievement, it 
could easily be argued that some other 
means of assessing achievement is 
needed. 

Frequently some type of paper-and- 
pencil task has been employed as 
a measure of achievement—ranging 
from simple code substitution to highly 
complex arithmetie problems. 'The re- 
sults generally have shown that Ss 
with high achievement motivation tend 
to exhibit higher performance scores 
than do low-achievement-motivated 
Ss, especially on the simpler tasks 
(French, 1955). And although paper- 
and-pencil tasks may afford a more 
reliable index of achievement, they do 
not usually represent actual class ma- 
terial. If the influence of achievement 
motivation is to be fully understood, 
particularly at the college level, it 
would seem highly desirable to have 
the tasks represent actual course ma- 
terial. 

One technique which seems valuable 
in this regard is programed instruction. 
Not only is a student’s rate of per- 
formance easily recorded, but also the 
difficulty of all items (tasks) is kept 
intentionally low. High-achievement- 
motivated students thus have an op- 
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portunity to work faster and to show 
higher overall proficieney than do Ss 
with low achievement motivation. One 
of the two major purposes of this 
study, then, is to verify the hypothesis 
that high-achievement-motivated Ss 
will exhibit higher performance on pro- 
gramed instruction than will Ss with 
low achievement motivation. 

A second major purpose of this 
study is to investigate the relationship 
of a second, equally important motiva- 
tion variable—test anxiety—to per- 
formance on programed instruction. 
Although the research interest in test 
anxiety seems to be increasing, its in- 
fluence upon performance within the 
classroom, particularly on programed 
instruction, has not been sufficiently 
studied. 

The results have generally shown 
that low-anxious Ss perform better 
than high-anxious Ss when the task 
involves either complex tasks or stress- 
motivating instructions (Lucas, 1952; 
Nicholson, 1958; Sarason, Mandler, & 
Craighill, 1952). Considerably less 
attention has been devoted to ascertain 
situations whereby high-anxiety Ss 
Would actually excell in performance. 
One of the few studies to approach this 
problem was by Grimes and Allinsmith 
(1961) in which it was found that 
highly anxious children worked best 
in highly structured situations. Since 
programed instruction does represent 
a fairly well-structured, nonstressful 
situation as a rule, it would seem plau- 
sible to expect the high-test-anxious Ss 
to work faster and show greater pro- 
ficiency than low-test-anxious Ss on 
this type of task. This, then, represents 
the second major hypothesis to be 
verified by this investigation. 


METHOD 


The Ss for this study included 139 under- 
graduate students in educational psychology 
at Indiana University. Before beginning the 
programed-instruction phase of the course, 


all Ss were administered the following three 
instruments: 

1. Iowa Picture Interpretation Test (Form 
RK). This test, developed by Hedlund 
(1953), consists of four alternative responses 
to each of 24 Thematic Apperception Test 
(TAT) pictures. S is instructed to rank order 
each set of four alternatives on the basis of 
how well he describes what is happening in 
the picture. S's achievement motivation, la- 
beled his “achievement imagery” score, is 
derived by summing the ranks assigned by 
him to each alternative previously desig- 
nated as an А: choice. 

2. Test Anziety Questionnaire (TAQ). 
Mandler and Sarason (1952) developed this 
instrument to assess a person’s feelings and 
attitudes about a variety of testing situa- 
tions, for example, achievement, aptitude, 
ete. The strength of a person’s subjective 
response is marked along a 0-15 point con- 
tinuum and a total score obtained by sum- 
ming for all 35 items. 

3. Achievement pretest. This test, con- 
sisting of objective-type questions, was de- 
signed to serve as a preliminary measure of 
S's skill and knowledge of basic principles 
related to test construction and evaluation. 
The questions closely paralleled but did not 
duplicate statements appearing on the pro- 
gramed-instruction material. 

During the 3 weeks which followed, ap- 
proximately 2 hours each week, Ss were given 
a series of three linear-type programed book- 
lets, dealing with the construction and anal- 
ysis of classroom achievement tests.’ Special 
care was taken at the outset to explain the 
main purposes and advantages underlying 
this type of instruction and to stress the 
relevance of the material being presented. 
Each booklet contained about 50 supply-type 
frames to which S responded on a specially 
prepared answer sheet. A record was kept of 
the number of correct responses for each S 
as well as the number of minutes spent. 
Mimeographed reading material, unrelated 
to the programed material, was handed to 
each S as he completed each unit so that all 
Ss began a unit at the same time and spent 
the same number of class periods. The ex- 
perimenter supervised all aspects of the pro- 
gramed instruction. 


Dersicn 


On both the achievement-motivation and 
test-anxiety scales, Ss were categorized as 


*The authors are indebted to Jol 
с à hn Krum- 
boltz of Stanford University for рне 
most of the programed instruction material. 
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being either High (above the median) or 
Low. Ss were also classified as High (above 
the median) or Low on the basis of their 
pretests so that within each level of achieve- 
ment motivation and test anxiety Ss could 
be matched according to their pretest ability. 
This produced a 2 X 2 X 2 factorial design 
and permitted the application of a three-way 
analysis of variance on each of several per- 
formance measures. 

To adequately represent Ss’ performances 
on the programed instruction, three separate 
measures were obtained. These included: (a) 
length of time each S took to complete the 
material, (b) number of frames answered 
incorrectly, and (c) S's proficiency on an 
achievement test”, administered at the con- 
clusion of the programed instruction. 


RESULTS 


Table 1 presents a comparison of 
means on the three performance meas- 
ures for Ss classified as High or Low 
in achievement motivation and High 
or Low in test anxiety. 

In order to statistically test the two 
hypotheses—namely, that a higher 
degree of motivation, either achieve- 
ment motivation or test anxiety, would 
produce better performance on the 
programed instruction—an analysis of 
variance was performed on each of the 
three performance measures.3 

On the first performance measure— 
minutes taken to complete the pro- 
gramed material—all three main ef- 
fects were significant and in the pre- 
dicted direction. Ss who were high in 
achievement motivation, test anxiety, 
or who scored high on the pretest 
tended to work through the program 
at a faster rate (F's = 11.20, 5.08, 7.10, 
respectively; df — 1/131). It should be 


?'The achievement test was constructed as 
a parallel form of the pretest. Both tests 
were found by a previous analysis to have 
internal consistency reliabilities above .75 
and to intercorrelate .81 with each other. 

3 These and succeeding calculations were 
performed on the IBM 709 data processing 
system at the Indiana University Computing 
Center. Grateful acknowledgement is made 
to Dale Hall who made computer time avail- 
able for this study. 


TABLE 1 


MEAN Scores ON THREE MEASURES OF 
PERFORMANCE IN PROGRAMED 


INSTRUCTION 
Groups Time | Errors Reten € 
Low AM-Low TA 125.2 | 26.5 | :20.1 
Low AM-High TA 124.6 | 19.9 | 20.6 
High AM-Low TA 107.6 | 20.2 | 24.1 
High AM-High TA | 100.0 | 13.8 | 27.0 


Note.—AM: Achievement Motivation. 
TA: Test Anxiety. 


noted, however, that a significant in- 
teraction occurred between achieve- 
ment motivation and test anxiety (F 
= 4.16; df = 1/131). 

On the second performance meas- 
ure—number of incorrect responses on 
the programed material—two main 
effects were significant. Ss high in 
achievement motivation or test anx- 
iety tended to commit fewer errors 
(Fs = 9.25, 5.18, respectively; df = 
1/131). There was also a significant 
interaction effect between test anxiety 
and pretest ability (F — 4.08; df — 
1/131). 

As a third measure of Ss’ perform- 
ance on the programed instruction, 
scores on the short-term retention test 
were analyzed. Two main effects of 
achievement motivation and pretest 
ability were significant. Thus, Ss high 
on either of these two variables scored 
reliably higher on the retention test 
than did Ss low on the same variable 
(Fs — 8.00, 37.50, respectively; df — 
1/130). 


CONCLUSIONS 


The results of the present investiga- 
tion substantially confirm both hypo- 
theses. Specifically, it has been dem- 
onstrated that groups with high 
achievement motivation or high test 
anxiety (a) required less time to com- 
plete the programed learning material; 
(b) made fewer overt errors on the 
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material, and (c) received higher 
Scores on a short-term retention test 
than did low-achievement-motivated 
or low-test-anxiety groups. These 
findings are in accord with some of the 
reported research, though it is hazard- 
ous at this point to make any direct 
comparisons. The occurrence of several 
significant interactions adds further 
scepticism and doubt regarding the 
overall effects of any one motive. 

One important conclusion which has 
stemmed from the analysis of both 
Time and Errors scores is that on well- 
structured but relatively easy mate- 
rial such as programed instruction, the 
highly motivated Ss work much more 
efficiently than the less-motivated 
Ss. In summary, the high-achieve- 
ment-motivated and high-test-anxiety 
groups work faster and make fewer er- 
rors than those who are low on either 
variable. One possible explanation for 
this is that the highly motivated Ss are 
able to derive greater benefit from the 
immediate knowledge of results, char- 
acteristic of programed instruction. 
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CULTURAL INFLUENCES ON RETENTION OF LOGICAL 


AND SYMBOLIC MATERIAL’ 
HERBERT HARARI AND JOHN W. McDAVID 


University of Miami 


American educational and training practices are characterized pre- 
dominantly by objective testing based on recognition whereas those of 
Israel are characterized by subjective testing based on reconstruction. 
To explore effects of these cultural differences upon retention, a logical 
content task and a content-free task comprised of symbols were ad- 
ministered to 320 12-year-old Ss, equally divided according to culture 
(American, Israeli), sex, recall interval (immediate, 48-hour delayed), 
and retention measure (recognition, reconstruction). Significantly 
higher reconstruction scores for the Israeli group partially support the 
hypothesis that cultural factors may give rise to differences in perform- 
ance on retention tasks. The results suggest the relevance of experien- 


tial factors to individual differences in performance on various types of 


learning measures. 


Early anthropological studies, in- 
trospective and anecdotal rather than 
experimental in nature, suggested that 
memory is primarily a function of the 
intrinsic interests of the individual 
(e. g., Bartlett, 1932; Nadel, 1937). 
These interests were presumed to be 
related to the individual's past mean- 
ingful experiences within his social 
group. Only one really experimental 
cross-cultural study dealing with 
memory has been reported: a com- 
parison of immediate-recall ability of 
Caucasian, Japanese, and Chinese 
children in Hawaii (Louttit, 1931). 
However, this study suffers some in- 
adequacy since all subjects (Ss) be- 
longed to the same educational system 
as well as to fairly homogenous sub- 
cultures. 

In America, experimental studies of 
memory dealing with recall and rec- 
ognition have rather consistently 
shown scores on recognition measures 
to be superior to scores on recall meas- 


1The data on which this report is based 
were collected in connection with the first- 
named author's master's thesis in psychol- 
ogy at the University of Miami (Harari, 
1962). The constructive suggestions of Rob- 
ert B. Tallarico as well as the assistance of 
Pinhas Sommer in collecting data in Tel 
Aviv are gratefully acknowledged. 
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ures (e. g, Anastasi, 1932; Burt & 
Dobell, 1955; Luh, 1922; Phillip & 
Peixotto, 1949; Stalnaker, 1935). Ex- 
perimental cross-cultural studies have 
focused their attention on various as- 
pects of individual and group per- 
formance on tests of intelligence rather 
than of memory (Buros, 1959). Thus, 
there is support for Klineberg's (1954) 
contention that “in absence of re- 
search on memory in a variety of cul- 
tures, it is not possible to state whether 
the ‘laws’ of learning and forgetting 
would be equally valid under all so- 
cial conditions [p. 226].” 

'The study reported here compared 
the performance of American and Is- 
raeli children on various memory 
tasks using both recognition and re- 
construction as measures of retention. 
The experience of the senior author in 
dealing with high school students in 
both American and Israeli educational 
institutions led to the assumption that 
the academie training of American 
children relies heavily on recognition 
methods in assessing learning while 
that of Israeli children relies heavily 
on reconstruction methods. Thus it 
was expected that these cultural vari- 
ables which differentiate the educa- 
tional background of the two groups 


CULTURAL INFLUENCES ON RETENTION 19 


should facilitate performance on rec- 
ognition measures for American stu- 
dents, and conversely, should facili- 
tate performance on reconstruction 
measures for Israeli students. 


METHOD 


Subjects 


Ss were 320 12-year-old children, 160 boys 
and 160 girls. Of these, half were enrolled in 
middle-class suburban public schools in 
southern Florida (the American sample), 
and half were enrolled in a comparable 
school in Tel Aviv (the Israeli sample). All 
Ss were Jewish. Ss were drawn at random 
from the total populations of each school 
and were assigned randomly to each of the 
experimental conditions described below. 


Learning Tasks 


A meaningful content task, the Marble 
Statue test, was adopted from Whipple's 
(1915) manual after being judged as culture 
fair by three American and three Israeli 
educators. A nonmeaningful symbol task 
was specially constructed after considerable 
pretesting. It consisted of five sets of two- 
digit numbers with a simple geometrical 
design opposite each set. Black digits and 
symbols approximately 3” X 3” were pre- 
sented on a white 22” X 24” card. Both the 
symbol task and the content task were used 
to secure immediate as well as delayed 
measures of retention. 


Retention Measures 


Although the same learning tasks were 
presented to all Ss, two methods of assessing 
retention were utilized. Half the Ss in each 
cultural sample (40 Ss of each sex) were 
told to reproduce the Marble Statue story 
in writing and to reproduce the symbols 
with their matching digits in writing. The 
remaining half were administered a one-out- 
of-three multiple-choice recognition test on 
each task. The recognition test on the 
Marble Statue task contained items dealing 
with 67 distinct ideas in the story as identi- 
fied by Whipple (1915); in the five-item 
recognition test on the symbol task, the 
digits served as stimuli, and the geometric 
designs as multiple-choice alternatives. 
Within each of the subgroups thus gener- 
ated, half the Ss underwent immediate 
assessment of retention, and the other half 
were tested after a 48-hour delay. Ss were 


not told that they would later be tested 
on what they had learned and were thus 
left to assume that the experiment was 
terminated after the first session. No special 
attempt was made to prevent communica- 
tion among Ss during the 48-hour delay 
interval, but there was no evidence in 
either group that the experimental situation 
excited any appreciable amount of discus- 
sion among the students during this time. 
Although the testing was conducted by dif- 
ferent experimenters in Tel Aviv and in 
Florida, explicit and formalized procedures 
were outlined, and each experimenter ad- 
hered literally to these standardized instruc- 
tions. In order to allow use of analysis of 
variance procedures, a fixed number of Ss 
were scheduled for inclusion in each ex- 
perimental condition. Thus, four experi- 
mental subgroups, cach containing 20 males 
and 20 females, were erected within each 
cultural group: immediate recognition, im- 
mediate reconstruction, delayed recognition, 
and delayed reconstruction. 


Scoring 

In scoring reproduction of the content 
task, one point was allowed for each cor- 
rect idea included as prescribed by Whipple's 
(1915, p. 336) manual. The recognition test 
was scored by allowing one point for each 
correct answer. In the symbol task, the re- 
construction measure was scored by allowing 
two points for each correct matching of 
properly reproduced symbols and digits 
and one point for any other correct repro- 
duction of either a symbol or a set of digits. 
Since the measures of recognition and re- 
construction were not inherently compara- 
ble, no direct comparisons of the two 
scores within each cultural subgroup were 
possible. 


RESULTS AND Discussion 


Mean scores for each of the experi- 
mental subgroups on each of the 
measures of retention are reported in 
Tables 1 and 2. 

Reconstruction scores for the con- 
tent task were submitted to a three- 
dimensional (Sex of S x Cultural 
Group x Delay Interval) factorial 
analysis of variance. Scores for the 
Israeli students were significantly su- 
perior to those for the American stu- 
dents (F — 30.16, p « .01). Scores for 
both cultural groups on the immedi- 
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TABLE 1 


SUMMARY OF SCORES FOR AMERICAN AND IsRAELI STUDENTS ON IMMEDIATE AND DELAYED 
RECONSTRUCTION AND RECOGNITION MEASURES OF RECALL WITH A MEANINGFUL 
CONTENT LEARNING Task 


Males Females 
um M SD м SD 
American 
Reconstruction measures 
Immediate recall 26.85 6.35 24.85 8.70 
Delayed recall 20.20 5.70 22.05 4.85 
Recognition measures 
Immediate recall 6.55 .80 6.50 .66 
Delayed recall 5.75 .94 6.50 .66 
Israeli A 
Reconstruction measures 
Immediate recall 34.85 6.71 31.30 6.70 
Delayed recall 24.90 6.50 26.10 6.45 
Recognition measures 
Immediate recall 6.65 .57 6.90 .30 
Delayed recall 5.70 1.00 6.80 .46 


ate-recall measure were superior to 
scores on the delayed-recall measure 
(F = 33.92, p < .01). Although the 
overall simple sex factor was not sig- 
nificant, the interaction between sex 
of S and recall interval was sig- 
nificant (F = 4.12, p < .05). While 
this interaction has no bearing on the 
purpose of the investigation, it is at- 
tributable to sex differences in op- 


posite directions within each of the 
recall-interval groups: Boys were su- 
perior to girls on the immediate-re- 
construction measure but were inferior 
to girls on the delayed-reconstruction 
measure. This pattern occurred within 
both cultural groups. 

Scores on the recognition measure 
of recall for the content task, as well 
as those on both the recognition and 


TABLE 2 
Summary OF SCORES FOR AMERICAN AND ISRAELI STUDENTS ON IMMEDIATE AND DELAYED 
RECONSTRUCTION AND RECOGNITION MEASURES OF RECALL WITH A MEANINGLESS 
SYMBOL LEARNING TASK 


Males Females 
Group 
us SD M SD 
American 
Reconstruction measures 
Immediate recall 12.20 3.20 12.30 ET 
Delayed recall 4.70 1.85 7.65 BAD 
Recognition measures 
Immediate recall 4.60 .58 4.70 Д 
Delayed recall 3.30 1.65 4.30 "T 
Israeli 
Reconstruction measures 
Immediate recall 13.05 1.85 12.45 2.05 
Delayed recall 10.20 4.55 11.00 3.60 
Recognition measures 
Immediate recall 4.50 .57 4.80 40 
Delayed recall 4.30 .78 4.80 “31 
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reconstruction measures for the sym- 
bol tasks, were distributed inappro- 
priately and required nonparametric 
analysis. Using the Mann-Whitney U 
statistic (Siegel, 1956), no significant 
differences between the two cultural 
groups were revealed. Thus, the hy- 
pothesis that American students 
should perform better on recognition 
measures found no direct support. 

However, a comparison of delayed- 
recall performance measures for each 
of the cultural groups on the symbol- 
association task showed that Israeli 
students were superior to their Ameri- 
can counterparts on both reconstruc- 
tion (p < .01) and recognition (p < 
.01) scores. While it is possible that 
the Israeli students were simply in- 
tellectually superior to the American 
students, there is no basis either from 
the overall data in this study or from 
characterization of the two S pop- 
ulations to support such an as- 
sumption. Rather, it appears that stu- 
dents who do well in reconstruction 
tasks may do well also in tasks in- 
volving recognition but that the re- 
verse does not necessarily hold true. 
Assuming no real differences in actual 
potential or ability between the two 
cultural groups, these observations are 
interpreted as likely consequences of 
differences in training and experience 
with particular educational and test- 
ing practices. 

If it were possible to compare scores 
on the reconstruction and recognition 
measures directly, a more satisfactory 
direct conclusion with reference to the 
initial hypotheses of this study could 
be drawn. Unfortunately, however, 
there was no means of equating scores 
on the two measures. Furthermore, it 
was not possible to utilize standard 
score conversions for such a compari- 
son since the score distributions on 
the recognition measures were highly 
skewed away from the ceiling of the 
measure. These score distributions. 
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suggest that the recognition measure 
(which had been constructed on the 
basis of pretesting with American stu- 
dents) was perhaps too easy, resulting 
in a stacking of scores at the ceiling 
of its score range. 

Further exploration of cultural dif- 
ferences with respect to various meth- 
ods of assessing retention certainly 
seems warranted. The data presented 
here provide only tentative support 
for the initial hypotheses advanced, 
and considerable refinement of the as- 
sessment methods utilized in further 
studies will be necessary in order to 
provide definitive evidence concerning 
those hypotheses. Nevertheless, these 
data strongly suggest an effect of ex- 
periential and training factors upon 
individual differences in performance 
on various measures utilized in assess- 
ing the retention of learned material. 
The role of such “test-taking skills” 
is not confined merely to achievement 
testing but may also have serious im- 
plications with respect to cross-cul- 
tural comparisons of intelligence and 
aptitude measures. 
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EFFICIENCY IN MEASURING THE EFFECT OF 
ANXIETY UPON ACADEMIC PERFORMANCE' 


NEIL A. CARRIER 
Southern Illinois University 


AND 


DONALD О. JEWELL? 
Valparaiso University 


The extent to which academic examination performance could be pre- 
dicted from several measures of academic achievement anxiety was 
tested by 3 studies. To attain the greatest (most efficient) predictive 
utility from the shortest amount of testing time, interaction effects 
between anxiety measures were determined, expressed as cross prod- 
ucts, and included as independent variables in each multiple-regression 
equation. The dependent variable was score on a final examination in 
introductory psychology. This procedure increased predictive efficiency 
using selected anxiety measures. Prediction of examination perform- 
ance was better for female than male Ss. 


Recent research on the effect of anx- 
iety upon performance has seen the de- 
velopment of a number of specific in- 
struments for the measurement of 
anxiety related to academic achieve- 
ment performance (Alexander & Hu- 
sek, 1962; Alpert & Haber, 1960; 
Mandler & Sarason, 1952; Sarason, 
1958). Alpert and Haber (1960) have 
concluded that such scales are better 
predictors of academic performance 
than are general anxiety scales and 
seem to account for variance in such 
performance beyond that accounted 
for by aptitude measures. Preliminary 
unpublished work by the authors, pri- 
marily performed within analysis of 
variance designs, not only supported 
the first of these conclusions, but also 
suggested interactions between specific 
anxiety measures in accounting for 
variance in course examination per- 
formance. 

Since one of our objectives was to 
be able to test the accuracy of predic- 
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tions of academic performance made 
from anxiety scores, it was decided 
to employ multiple-correlation proce- 
dures in subsequent work. In multiple- 
regression analyses possible interac- 
tion effects may be determined by 
including the cross products of vari- 
ables as independent variables and 
testing the significance of their con- 
tribution to the multiple-correlation 
coefficient. If such a procedure could 
thus identify cross products which 
would increase a multiple correlation 
beyond that based upon anxiety scores 
alone, a greater “efficiency” of predic- 
tion would be possible in the sense of 
accounting for more variance with the 
same amount of testing time. In addi- 
tion to this empirical rationale, the 
position was taken that since a number 
of different types of anxiety measures 
were to be employed, each presumably 
measuring a somewhat different aspect 
or kind of anxiety, there is some com- 
mon-sense logic in assuming that cer- 
tain combinations of scores may mean 
more than the independent scores 
alone. As a consequence, three consec- 
utive studies were conducted to de- 
termine what minimum combination 
of several measures of test anxiety and 
their cross products would provide the 
best prediction of final-examination 
performance. 


g 


Srupy I 


Procedure 


The subjects were 137 students in an in- 
troductory psychology course at Southern 
Illinois University. Approximately at mid- 
quarter they took Alpert and Haber's (1960) 
19-item Achievement Anxiety Test (AAT). 
From two subsets of items “debilitating anx- 
iety” (A—) and “facilitating anxiety" (A+) 
scores were obtained. Immediately prior to 
a second hourly examination, a 33-item form 
of the Anxiety Differential (AD) was given 
(Alexander & Husek, 1962). A final examina- 
tion of 125 multiple-choice questions pro- 
vided the measure of academic performance. 
A multiple-regression analysis was carried 
out using the final-examination raw score as 
the criterion variable, and raw scores on the 
AD, both scales of the AAT, and the three 
possible cross products of these scores as 
independent variables. Each independent 
variable was tested to determine the sig- 
nificance of its contribution in accounting 
for variance in the criterion variable. The 
“best” mutiple-regression equation (largest 
multiple-correlation coefficient from the few- 
est independent variables) was sought for 
each sex and for the sexes combined, 


Results 
The Study I portion of Table 1 


shows the first-order and multiple-cor- 
relation coefficients of the three anx- 
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iety measures (alone and in various 
combinations) with the criterion meas- 
ure. The most notable feature of the 
data is the fact that adding their cross 
products to the three anxiety measures, 
as independent variables in the mul- 
tiple correlation, increased the coeffi- 
cient. This is more obvious in the case 
of the female than in the male subjects. 

These results were regarded as en- 
couraging enough to pursue a second 
study. The same measures and tech- 
niques employed in the first study were 
used to permit a check on the reliabil- 
ity of the results obtained in the first 
study. The unique aspect of the second 
study was the addition of more anxiety 
measures. 


Srupy П 
Procedure 


The following quarter a second group of 
151 subjects was tested the same way as in 
Study I. In addition, Sarason’s (1958) Test 
Anxiety Scale (TAS) was given about mid- 
quarter and scored for the usual six scales. 
After determining the contribution of each 
of the new variables and the cross products 
in accounting for variance in the criterion 
(final-examination performance), two scales 


TABLE 1 


FIRST-ORDER AND MULTIPLE CORRELATI 
FINAL-EXAMINATI 


ONS OF CERTAIN VARIABLES WITH 
on Raw Score 


Study I Study II Study III 
Correlations 
M F | M&F к |м 
= F &F 
М = 98 | = 39 N = 137 |N = 4 |N = 57 яе ез P, MET 
Шз 
.23* .22 .. 
Ab -4| -p| um EIE 82 
Multiple = —.31 | —.08 | —.19 
АРАС ар e| d on 32| ане 50% 
AEN un M 
y 9, 

A--A—AD,(A4)(A—), ^* 
(DD. ASAD) С diss 38 .05**| 51 
A-P (TÀ) (A — (TA) 

A+,A—NA(A+)(A—), 

(A-+) (NA), (A) (NAY " 
-43 .68** .56* 
өр < .05. 


** p» < 01. 


ANXIETY AND ÁCADEMIC PERFORMANCE 


of the TAS were chosen as promising: Test 
rc (TA) and Need for Achievement 


Results 


The Study II portion of Table 1 
shows selected first-order and mul- 
tiple-correlation coefficients. The re- 
sults obtained in the first study using 
the three scales (A+, A—, AD) were 
fairly generally repeated in the second 
study. Notable increases in coeffi- 
cients were apparent, however, in the 
case of the female subjects. Also, it 
was again seen that adding cross prod- 
ucts as independent variables in the 
regression equation increased the mul- 
tiple-correlation coefficient. The same 
sex difference was observed, with pre- 
diction likely to be better for females 
than for males. 

Since the best combination of three 
anxiety measures (and their cross 
products) seemed to be the A+, А-, 
апа NA scales, these were selected for 
the third study in which (a) confirma- 
tion of the results obtained here was 
to be sought; and (b) these three 
Variables (and their cross products) 
Were to be used to predict final-exam- 
mation performance. 


Srupy III 


Procedure 


TN third group of 66 subjects was admin- 
oes the AAT and the NA scale during 
ale fourth week of a quarter. The latter was 
th Ministered as a separate instrument rather 
wa: a subset of items of the TAS. This 
in Wi one in the interest of economy of time 
" esting. The AAT and NA scale were 
took ered together, in that order, and 
do m a maximum of 15 minutes for subjects 
баш el: The basic objective was to 
tie for as much of the variance in final- 
iet nation performance as possible with as 
an assessment of anxiety as possible. 
elonet multiple-regression equations de- 
second for each sex from the data of the 
Scores study, predicted final-examination 
obtai were calculated using the six scores 
ined from the A+, A—, NA, and their 
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three cross products. At the end of the quarter 
the actual score on the 125-item multiple- 
choice final examination was obtained. The 
first-order correlations of predicted with ac- 
tual score were .32 for males and .52 for fe- 
males. The size of these coefficients is en- 
couraging when it is remembered that the 
data upon which they were based were ob- 
tained from only 15 minutes of nonintellec- 
tive testing, and that the predictive equations 
were based upon data obtained from a rela- 
tively small sample of the student popula- 
tion. 

The third portion of Table 1 shows the 
first-order and multiple correlations on the 
data collected in the third study. The results 
from the second study were generally sup- 
ported, but again the size of the female sub- 
jects’ correlations showed an apparent in- 
crease and were substantially higher than 
those of the male subjects. The contribution 
to the multiple correlation by the cross 
products of the three anxiety scales is less 
for females in the third study than in the 
second. This is probably due to the unex- 
pectedly high multiple correlation obtained 
using only the raw scores as independent 
variables. Since there are probably a number 
of other variables not included here that are 
related to academic performance (aptitude, 
etc.), very high multiple correlations that 
omit them probably cannot be expected. It 
appears that as the multiple correlation 
based upon these anxiety raw scores ap- 
proaches 70, the contributions of cross prod- 
ucts necessarily diminish. 


Discussion 


The data of these studies support the 
contention that academic examination 
performance can be predicted from 
scores on self-report measures of anx- 
iety. This prediction is increased when 
such scores are selectively combined in 
the form of a multiple-regression equa- 
tion, Even further efficiency may be 
obtained from the measures by the 
selective inclusion in the regression 
equation of the cross products of the 
measures. 

The present data are in disagree- 
ment with the findings of Dember, 
Nairne, and Miller (1962) which led 
them to conclude that the AAT “is not 
as useful for female as for male sub- 
jects [p. 428].” The present data sup- 


26 Ne А. CARRIER AND Рохлір О. JEWELL 


port an opposite interpretation. While 
it is not known what differences in 
sample characteristics or other vari- 
ables are responsible for this discrep- 
ancy, an unpublished study by the 
first-named author is suggestive. That 
study found a sex discrepancy in first- 
order correlations betweeen the AAT 
scales and hourly examination scores 
consistent with that reported by Dem- 
ber et al. (1962). Contrary to the 
present study, however, subjects were 
drawn from two "small" sections of a 
course in child psychology, rather than 
from the larger introductory course 
taught (and examined) in the large 
lecture format. These groups appar- 
ently resemble those tested by Dember 
et al. more than either resembles those 
of the present study.? 

In addition to the AAT and NA 
scale, the AD and TA scales appear 
to have some merit in accounting for 
examination performance. A method- 
ologieal procedure which may have 
prevented the AD scale from making 
its maximum contribution in the first 
two studies was the fact that this 
measure of “situational anxiety" was 
not given immediately before the final 
examination from which the criterion 
scores were obtained. However, it was 
given in each study in conjunction 
with a previous hourly examination, 
and earlier data gathered by the au- 
thors showed AD scores correlated 
about the same with examinations 


*The unpublished study also found the 
AAT scales (especially the A—) correlated 
higher with performance on the essay than 


with the multiple-choice questions in the 
course. 


taken immediately following its ad- 
ministration and those taken several 
weeks later. 

The findings also suggest a number 
of interesting questions. If anxiety fac- 
tors do play a considerable role in per- 
formance on academic tests, how much 
of the variance in the scores on college 
admission tests is a result of these fac- 
tors? What nonanxiety personality 
variables interact with anxiety to in- 
fluence examination performance? Do 
the effects of anxiety upon perform- 
ance vary with the type of examina- 
tion (recall, recognition, etc.) ? What 
situational conditions affect examina- 
tion performance, and can the detri- 
mental effects be reduced? The authors 
are now pursuing studies addressed to 
these questions. 
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The present study inves 
interest in the rewards of 
responsibilities clash 
sisting of 4 Likert sca 
sertion, Status Seeking, 
a variety of personal data were 
in Grades 10, 11, and 12. Aca 
grade-point average, 
4 aspects of dating for 
attractions for girls. Number 
boys’ interest in all aspects 0 
ship was reliably assoc: 
for girls with the sexua 


Ж... youth-culture literature sug- 
upon hat adolescents who lean heavily 
likel peer functions for support are 
adul to be diverted from attaining 
tion [; nm objectives of socializa- 
Wi port, 1961; Parsons, 1962; 
sso t, 1963). One might expect, 
School re that adjustment to high 
jai Bonsibilities would be af- 
adoles adversely by the extent to which 
of Wii strive for the attractions 
is hi es culture. The assumption 
aiten y general, and the present study 
етее - to evaluate empirically the 
in the т which it holds for adolescents 
Sore 11th, and 12th grades by 
тяни interest in certain specific 
Ous as ions of social dating with vari- 
БЫ мү of orientation toward high 
The E» d toward peer relations. 

adult elief that youth-culture and 
traditi oals may be incompatible stems 
ory wee from psychoanalytic the- 
Widely ich has provided a basis for the 
tion in âdopted belief that participa- 
marily the youth culture serves pri- 
belliou, as an outlet for adolescent Te- 
logo. ness (Hurlock, 1955; Keniston, 
e 1958). Analyses of 
emerged. ture activities also have 
recently through the influence 
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iated for bo 
1, aspect of dating. 


tigates the hyp 
social dating and involvement in high school 
detrimentally. 
les (Sexual Grati 
and Participative Eagerness) and requesting 
administered to 393 boys and 346 girls 
demic performance, 
was negatively associ 
sexual and in 
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othesis that relatively strong 


dating questionnaires con- 


Social 
fication, Independence As- 


as measured by 


ated with interest in all 
e-assertion 


learning theory, which 
stresses the formative influence of re- 
ward-and-punishment features of fa- 
milial and cultural functions (Ban- 
dura, 1964; Elkin & Westley, 1955; 
Parsons, 1962). This viewpoint asserts 
that adolescents may be more inter- 
ested in participating in activities of 
ulture than of families or 
he extent that its attrac- 
tions and privileges are more appeal- 
implies that adoles- 
ho seem defiantly independent 
d goals 


of social 


youth-culture cohorts. 
Hence, both psychoanalytic and social 
learning theory lead to the belief that 
extensive interest and involvement in 
youth-culture attractions may be as- 

atisfactory adjust- 


wards from 


sociated with uns adiu 
ment to high school responsibilities. 
Moreover, the social learning view 


youths' diversion from 
ay be, in part, a 
1 appeal of youth- 
and 


stresses. that 
their obligations m 
result of the powerfu 
culture rewards, satisfactions, 
comforts. 

The present investigation of youth- 
culture functions is restricted, in order 
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thatit might be reduced to manageable 
proportions, to the appeal of social 
dating. For present purposes, social 
dating is defined as encompassing all 
situations in which the principal actors 
constitute a heterosexual pair, are un- 
married, and are unchaperoned, free 
agents. Dating patterns, ranging from 
casual dating to engagement, encom- 
passed by the definition offered here 
appear to share several common at- 
tractions (cf. Method, below). The re- 
wards most easily obtained through so- 
cial dating are available in various 
contexts and in countless other con- 
figurations; as defined here, social dat- 
ing merely is one of many systems 
through which they are obtained. 
However, analysis of the dating social 
system provides an excellent realm for 
investigating the youth culture be- 
cause maturation of sexual propensi- 
ties and cultural emphases upon heter- 
osexual associations are likely to make 
the rewards of social dating especially 
compelling for most youth. 

The interpretations drawn from both 
psychoanalytic and social learning 
theory and the analysis of social dating 
presented above suggest that adoles- 
cents strongly involved in the attrac- 
tions of dating would be least dedi- 
cated to high school responsibilities. 
The rewards for social dating are 
promptly and tangibly acquired from 
peers, and their consistent attainment 
requires investment of considerable 
time; by and large, the permanent 
benefits of discharging high school ob- 
ligations, that is, preparing oneself for 
additional education and citizenship, 
entail postponement of gratification, 
intangibleness, and consumption of 
vast amounts of time. The two systems 
apparently compete for youths’ atten- 
tion, and many youths seemingly for- 
sake one for the other. Specifically, 
then, one might expect that orientation 
toward the incentives of social dating 
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would be inversely related to grade- 
point average, attention to homework, 
and vocational aspirations. Since 
school extracurricular activities are 
adult sponsored, controlled, and reflect 
essentially adult-oriented rewards, one 
would also expect an inverse relation- 
ship to prevail between the social dat- 
ing interests and participation in ex- 
tracurricular activities such as school 
clubs and councils. Further, inasmuch 
as peers are the primary administra- 
tors of youth-culture rewards, adoles- 
cents who seek the attractions of dat- 
ing should be those who have a coterie 
of friends. 


METHOD 


Subjects 


Subjects were 393 boys and 346 girls from 
the 10th, 11th, and 12th grades, drawn in 
fairly equal numbers from “college,” “gen- 
eral,” and “basic” curricular groupings of an 
urban public high school, which enrolled 
some 2,000 pupils who were widely diversi- 
fied in terms of social class, academic skills, 
and vocational aspirations. 


Questionnaire! 


A comprehensive list of the attractions 
offered by social dating was compiled pri- 
marily from the descriptive interpretations 
of sociologists, the replies of several ad- 
olescents when queried as to why they 
dated, and from various articles in news- 
Papers and teenage magazines. Analysis of 
the data from these sources led to the postu- 
lation of the following four incentive cate- 
gories: 

1. Sezual Gratification (SG). Dating offers 
sanctioned Opportunities to make physical 
contact with members of the Opposite sex, 
for example, to dance cheek to cheek, to hold 
hands at the theater, to stroll with an arm 
around the consort’s waist, to caress, to neck 


*Inasmuch as space limitations prohibited 
extensive discu: 


E ssion of the questionnaire, & 
20-page document containing complete cop- 
les of the revised forms for both sexes, 
means, standard deviations, and biserial r's 


Showing the item-criterion correlations of 
at item may be obtained by writing the 
or. 
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i Я А 
a parked car or in some secluded place, 
2. Inde А Е 
и узу р, тей 
tonomy of adul p vhich to achieve au- 
ateepted ult authorities and of generally 
on lonel standards of society. Drag racing 
to gai y streets, fudging one's age in order 
GE ear admittanee to an «tadults-only" 
sex mius dating members of the opposite 
uar b. e Th tarnished are 
bs y S chiel possibilities. Dating may also 
ре А Vehicle for deviating: from, Гал СӨ 
beliefs, practices; disdaining family political 
a жы parental strictures on curfew 
less of f sex-conduct rules, and being heed- 
class т propriety in crossing of social- 

S драва amaking, and in drinking. 
casions rud Seeking (SS). Dating offers oc- 
bers of tl associating with prestigious mem- 
consorts A opposite sex. One may prefer 
debatin who are champions or leaders in 
may ae athletics, or dramatics. Dating 

uode a vehicle for entering an “in group." 
cellent er may be sought who are ex- 
Quette, gra» els for developing social eti- 

4. Pm and sophistication. 
Hitwise Eagerness (PE). Rewards 
Pealing ion ing, dating sometimes із 8P- 
vides, for p the sake of the activity it pro- 
Teward m example, obtaining ап identifiable 
Onelines ay be of less concern than avoiding 
sponsibiliti boredom, anxiety, work-role re- 
Same-sex е4 вай functions with parents Ог 

E * 8. 

ats to the above categories provided the 
eparate an unidimensional Likert-type scale. 
Propriate [ei employing similar, SeX-AP- 
tach wale EDR; were prepared for each sex. 
ments that is comprised of items or - 
a same-se: purport to describe a choice that 
‘Wo eani араша has made between 
ion of eod: le alternatives: one, the attrac- 
esirable ial dating, and the other, either & 
tural пог nondate goal or adherence to cul- 
Zach Жен s, mores, parental strictures, etc. 
is of a d re-creates the situational dynam 
Miller mee approach-avoidance conflict 
those a The subjects (Ss) are asked 
Sach item. e the protagonist’s behavior in 
een in Me oed or not they have ever 
Brees of e situation—on the basis of five 
Summatio: agreement and disagreement. The 
9f the em of each S's responses to the items 
a total ааа scales yields for each category 
Senting me which is interpreted as repre- 
that, partic indication of S's motivation for 
“entive (Sel ar category of social dating in- 
1962), elltiz, Jahoda, Deutsch, & Cook, 


Ing 
‘On: s. ш 
structing the items, an attempt was 


made to provide simple and realistic con- 
flicts, to avoid response set by reversing the 
direction of the items, and to minimize 
arousal of self-consciousness and defensive- 
ness. Also, efforts were aimed at writing 
median items in order to maximize variance 
and, very importantly, to provide a safe- 
guard against including items that adoles- 
cents—and school officials who must sanction 
them before they can be administered— 
might regard as bizarre and/or irrelevant. 
Initially, 15 items for each scale were 
randomized for presentation in & 60-item 
booklet and were pilot tested upon two high 
school populations of 220 and 406 Ss, re- 
spectively. ‘After the second revision, each 
scale was reduced to 10 items. The final form 
of the questionnaire, therefore, comprised 40 
items. An example of boys’ Sezual Gratifica- 
tion, Independence Assertion, Status Seeking, 
and Participative Eagerness, respectively, is 


given below: 

1. A boy and girl are parked in his car 
beside the lake. She is shy but they have 
necked before. He would like to neck again, 
leaving. He tries to 
change her mind. 


2. А couple have stayed out once Or 


twice beyond the girl's curfew hour, as set 
by her parents. They are at an interesting 
party and they decide to stay longer since 
her parents don't enforce the curfew. 

3, A boy is eager to make the “honor” 


service society, but he must know more of 
the top members. Being а member will make 
der, too. He dates an inter- 


him a school lea ‹ i 
esting and attractive girl who is а member, 
just so he will be invited to the society's 


parties—thus he will get to know the mem- 
d be asked to join the society. 
4, A famous sports star 18 coming to town 


to make & publi e for one night 

only. A boy is еа 

his buddies are goin 

them. But then he finds out that a girl he 
ld like him to ask 


the four sca 

sions under 
entirely unrelated, 

scales further seemed unwar- 

ranted; moreover, he existence and nature 

з are themselves worthy 


be that particip о 1 
tion of status for girls. The differences among 
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TABLE 1 


INTERCORRELATIONS AMONG SExUAL-GnATI- 
FICATION (SG), INDEPENDENCE-ASSERTION 
(IA), SrATUs-SEEKING (SS), AND PAR- 
TICIPATIVE-EAGERNESS (PE) SCALES 
Fon Boys AND GIRLS 


Boys (N = 393) Girls (N = 346) 
Scale 
SG |1A| ss | PE | sc |a| ss |РЕ 
a |, 16 | .22 | .81^ |.32 | .21. |.a1 
B pe Hos) dé | 2 178%] 124 135 
SS 58% | 112 1645 |.30 
PE E 106^ 


^ Diagonal entries are Hoyt reliability values. 


the reliability measures reported in the di- 
agonals of Table 1 reflect in part differences 
among the items for each of the scales. The 
SS items for example, are relatively more 
dissimilar than the SG items, and the phe- 
nomenon probably mirrors the roal-life 
Scene; for example, Ss are less likely to 
diffuse the focus of sexual symbols and in- 
clinations than of status symbols. 


Personal Information 


The personal information items were 
grouped into three categories: validity, in- 
cluding such items as frequency of dating, 
curfew hour, nights 
‹ ч ss to car; academic, 
including grades received in school, hours 
demic degree aspira- 
› including number of 
close friends, participation in high school 


lowed with a list of appropriate responses, 
to be checked. 


Procedure 


Ss were administered the questionnaire 
and personal information items in their class- 
rooms by one of four experimenters, none 
of whom was a teacher. Ss were asked to 
raise their hands upon completing the ques- 
tionnaire; then, as each S finished, he was 
handed the personal information items, and 
his questionnaire was collected. The two parts 
together consumed approximately 30 min- 
utes. Only the most general instructions were 
given to Ss. They were told that all the 
data would be the confidential property of 
the university researchers; moreover, Ss were 
explicitly told not to put their names on the 
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papers. However, in order that the two sets 
might be matched, they were asked to use 
their birth dates. Serious cooperation on the 
part of Ss was stressed. All Ss were coopera- 
tive, and most appeared to be highly in- 
terested in participating in the project. 


Statistical Analysis 


Major statistical evaluation in this study 
was accomplished through use of à two-way 
analysis of variance design. Unequal fre- 
quencies in the subclasses necessitated the 
use of a special procedure described by 
Scheffé (1959). Each of the four scales of 
the questionnaire was treated as a depend- 
ent variable, and the personal information 


items were employed as classificatory factors 
in a univariate analysis of variance. Three 
sets of analyses were computed for each sex. 
Frequency of dating was cross-partitioned 
with each of several validity measures; addi- 
tionally, frequency of dating and grade in 
School, respectively, were cross-partitioned 
with each of the academic and peer-relations 
variables. 


Resutts AND Discussion 


The results presented in Tables 2 
and 3 include only main effects since 
none of the interactions with the fre- 
quency-of-dating and grade-in-school 
classifications were significant. Fur- 
ther, a main effect for the grade-in- 
School classification, by 10th, 11th, 
and 12th grades, did not appear, and 
these data also are not reported. 

The data offered in Table 2 pertain 
mainly to construct validation. One 
would expect, for example, that adoles- 
cents expressing relatively greater in- 
terest in the attractions represented by 
the scales would participate more fre- 
quently in the dating system and 
would be more likely to possess op- 
Portunities with which to pursue the 
rewards. The findings of Table 2 con- 
firm the expectation in respect to fre- 
quency of dating. Mean scores on the 
Scales Increase as dating becomes more 
active. The main effect is significant for 
boys on the four scales, that is, SG (P 
< 001), ТА (р < 001), SS (p « .05), 
and PE (5 < 001); for girls on the 
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TABLE 2 


Mean SCORES AND RESULTS OF ANALYSES OF VARIANCE FOR SEXUAL GRATIFICATION (SG), 
INDEPENDENCE ÁssERTION (IA), Sratus SEEKING (SS), AND PARTICIPATIVE EAGERNESS 
(PE) iN Terms or FREQUENCY or DATING AND OTHER Aspects OF SocIAL DATING 


Boys Girls 
Antecedent 
N SG IA SS PE N 56 ТА 55 РЕ 
Frequency of dating 
No dates yet 81|27.4 24.7 24.9 |27.9 4825.4 25.5 23.7|25.3 
1 per wk. 123/32.0 [29.0 [26.1 399.1 9628.0 — (25.8  |23.9/27.3 
1 per mo. 10634. [81.0 [27.2 31.0 9729.8 [27.0  |23.6|26.4 
3 4- per wk. 83]36.7 32.5 27.0 |32.4 105/32.8 29.7 24. 1/27.3 
F ratio 27.46***|24.30***| 3.50*|10.68***| — |14.11***| 5.86***| ns | ns 
Age dating began 
13 or younger 8535.9 [32.5 |28.1 92.4 4531.6 [28.4 |23.126.3 
14 99134.2 31.3 26.4 [30.4 1061380.2 28.2 [24.127.9 
15 170340 |98 [266 |30.2 [101302 |270 023.3/26.5 
16 or older 5132.4 |28.1 [25.9 330.0 4629.8 [26.6  |23.0120.7 
F ratio ns 4.95** | ns ns ns ns ns | ns 
Curfew " 
12 р.м. 1933.8 [29.5 (27.1 [305 190300 271 23.827.0 
2 A.M. 7235.3 131.3 |25.6 |30.6 5032.3 [297 [28.0278 
Апуйше 121/34.4 182.8 [27.1 [31.4 5829.4 27.1, 23.320.7 
F ratio ns 5.20** | ns ns ns 3.50 ns | ns 
Nights allowed out 
geris 9 [тт [25.4283 
6031.7 27.7 |26.3 28.9 69/30. 97. .4|28. 
2-3 177|34 6 31.2 27.1 |30.8 206/30.9 28.0 23.6/27.1 
4 or m 15136.2 33.2. [27.1 |825 | 2382.4 072 22.7|30.2 
F ms 4.99** | 9.58***| ns 4.46* ns ns ns | ns 
Car ilabili - 
Fr d 13055.1  |32.4 26.7 [30.5 е n p Le rs 
Sometimes 119340 180.4 [26.7 [31.3 } А 21.420. 
Seldom/never 6332.9 128.0 [26.9 [30.8 4420.2 20.3 [23.7 26.4 
F ratio as  11.04***| ms ns ns ns |ms|ms 
«р: < 05. 
же « OL 
*** 5 < .001. 


SG (p < .001) and ІА (p < .001) 


adolescent girls as being reluctant to 


i i ing 1 Ipably incon- 

scales, It is noteworthy that the engage m dating is pa 

Status-Secking and Participative- gruent with the facts of contemporary 
life, and it would seem that the appro- 


Eagerness scales yielded nonsignifi- 
cant main effects for adolescent girls 
on the frequency-of-dating classifica- 
tion. Girls who actively date and girls 
who seldom date appear either equally 
eager or equally disinclined to pursue 
the rewards of the dating system. The 
image of the present population of 


priate interpretation is that all girls, 
whether they date frequently, are 
eager to participate in the system and 
to seek the peer status it proffers. 

The findings presented in Table 2 
offer scant support for the notion that 
opportunity to acquire the attractions 
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of dating is associated with enhanced 
interest in them. Possession of freedom 
to participate in the dating system, 
however, appears to exert a differential 
effect upon the two sexes. In the in- 
stance of every classificatory variable 
shown in Table 2, the main effect 
on the Independence-Assertion scale is 
significant. Inspection of the means 
shows that given more or less unre- 
strained liberty, boys rather than girls 
seem more likely to assert their in- 
dividuality in the context of the dating 
system. It appears also that boys’ zeal 
for participating in dating and for 
seeking physical gratifications are as- 
sociated with the number of evenings 
they are allowed out per week, but not 
by whether they enjoy an early or late 
curfew or by whether they have access 
to a car. Of what avail, after all, are 
a late curfew and a car if opportunities 
to capitalize upon these advantages are 
highly limited? On the whole, girls who 
have relatively greater autonomy and 
opportunity would also be expected to 
be especially interested in the attrac- 
tions of dating since, by and large, the 
inducements are there for the taking. 
As in the instance of frequency of dat- 
ing, however, impositions and restric- 
tions notwithstanding, girls appear 
equally zealous to acquire the rewards 
of the dating system. 

The belief that youths’ enthusiasm 
for the incentives of Social dating 
would be associated inversely with 
academic performance is supported by 
the data presented in Table 3, Part I. 
The means based upon grade-point 
average show that high scores on the 
dating scales tend to be associated 
with low grades. The main effect is 
significant for boys on each of the four 
scales (SG, р < .05; IA, p « .05; SS, 
P < .01; PE, p < .001), and for girls 
on the SG (p < .05) and IA (p « .05) 
scales. Two measures associated with 
academic performance corroborate the 


interpretation. The data of Table 3, 
Part I, also show that the means of the 
dating scales that showed significant 
main effects decreased for both sexes 
as study time and aspirations in- 
creased. 

The data presented in Table 3, Part 
II, suggest, as expected, that an in- 
verse relationship exists between par- 
ticipating in adult-sponsored extra- 
curricular activities and certain of the 
social dating attractions. For the ex- 
tracurricular-participation classifica- 
tion, involving school clubs and coun- 
cils, significant main effects appeared 
for boys on the SS (p < .05) and PE 
(p « .001) scales, and for girls, on the 
IA (p « .05) scale. Also, the expecta- 
tion that adolescents’ interests in dat- 
ing may be associated with peer inter- 
action, on the assumption that peers 
are the primary administrators of 
youth-culture rewards, was largely 
confirmed. On the number-of-friends 
classification, the analyses of variance 
yielded a significant, main effect for 
boys on each of the four scales (SG, 
p < .001; IA, р < .001; SS, p < .001; 
РЕ, р < .05). The mean Scores for each 
scale increased with the possession of 
a greater number of friends. Tests for 
the number-of-friends measure were 
nonsignificant for girls, However, when 
the girls who reported having three or 
more friends are separated in terms of 
whether or not they share secrets with 
one another, following Hollingshead’s 
(1949) analysis of clique behavior, the 
expectation receives some support. The 
main effects are significant for the SG 
(p < .05) and SS (p < .001) scales, 
and the discrepancies between the 
means indicate that the girls with 
close-knit peer relations scored higher 
on the scales. This comparison holds 
ге for boys on the SS scale (p € 

To the extent that the variables em- 
ployed in the present study discrimi- 
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TABLE 3 
MEAN SCORES AND RESULTS OF ANALYSES OF VARIANCE FOR SEXUAL GRATIFICATION (SG), 
INDEPENDENCE ASSERTION (IA), Status SEEKING (SS), AND PARTICIPATIVE EAGERNESS 
(PE) Іх TERMS or ACADEMIC AND PrrEn-RELATIONS VARIABLES 


Boys Girls 
Antecedent 
у 56 2А ss PE | N SG IA SS РЕ 
1. aain variables 
rade-] ve 
пон вани пиа leza [а o 291 |иб1з раз раз 261 
B 99 (31:8 sez [59 [281 91|9.9 |1 [32 [270 
c 197 329 от |5 [29 |0215 29.1 41 278 
D 70 Bio |i pea 9x2, 37200, 041,48 [20.2 
i P tio " 2.62% | 6.82° | 4.16** | 5.54 3.01* | 3.40 ns ns 
rs. of study at home per wk. 
3-4 a E 154 [34.5 4 6.8 — |32.0 74 (32.5 30.4 24.1 28.2 
1-6 141 [31.7 9 9 [28.0 132 0.3 gra Bo 268 
1+ 98 [30.3 5 з $5 | 140 [28.5 25.0. 23. $e 
PL ratio 10.81***|10.89***| па  |18.54 8.46***[13. ns |3. 
Aspirations " 
High school degree 92 |33.5 7 [тз [1.4 76 |32.8 mi pa 28.3 
Vocational school 89 [32.8 1 207 [аз 95 802 з 10 т 
College 212 |32.0 9 25.9 29.2 175 |2 Т вен reas po E s 
F ratio ns 09* пз 2.99 10. А А 
TL, Peor-relations variables 
Е согтеја bles. 
Mracurricular participation |... lagg зз [ara SLT 08 20.5 4 22.5 
SEQ 188 522 [6 03 01 C ea pes. 12 
"В erate” 120 pA, [| ms [dio | 8.38" ns |3.23* | ne 
о. frie: 
oaa ві [20.6 [1 [252 28.0 g ge 27.0 23.9 
1-2 136 |32:2 201 [56 [20.7 йш BD D 
Hn 108 22.0 aCe 253 Ani 0102 [26.0 [23.1 
UP ratio 98 39.0 eeel 9:35***| 5.88%") 3-41* ns ns ns 
eu i ло 
Bc wr Bl EI Bi аа B» Wi 
no 55 |32.1 жй. в: [ns 4.32* ns  |10.83***| 


EU 


ts in terms of responsibilities find solace in indulg- 
their interest in obtaining the rewards ing, to a degree, in both +з cea 
of the social dating system, the data cations and refractoriness, p an 
of the present investigation suggest for which are peer-culture а niue 
that, in addition to the negative, puni- tered. Yet, for boys at least, peer re- 
tive, frustrating aspects of school life Jations and the status-seeking and the 
that may propel adolescents away participative features of dating appear 
from school, there may exist induce- to be the more closely associated. Boys 
ments that entice them away. More- who are nonparticipants in extracur- 
Over, it appears that the various at- уісшаг activities seem especially in- 
tractions of social dating are equally terested in participating in dating and 
influential across the 10th, 11th, and in using the dating system for seeking 
12th grades. On the whole, poor 305" status. Sex differences also emerge in 
demie performance is consistently Té- ^. beet to peer relations. Having 
lated to interest in sexual gratificatior friends per se is an important correlate 
end in asserting independence. Nearly or : motivation for all the re- 
every adolescent is imbued with the of boys f dating, whereas having 
cultural imperative to find a mate, and sources О d ds ansa Чо: Б 
it seems that adolescents who have close, intimate =: dub p A e 
been sidetracked from fulfilling school the more critical variable tor giris. 


nate among adolescen 
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EFFECTS OF TEACHER AND PEER SUBCULTURES 
UPON STUDENT ASPIRATIONS 


DONALD L. THISTLETHWAITE лхо NORMAN WHEELER 
Vanderbilt. University 


A panel of 1,772 students enrolled at 140 colleges was surveyed at the 


beginning, middle, 


and end of college. Multiple-regression equations 


were developed for predicting terminal dispositions to seek advanced 


graduate or professional training 
Residual scores, used as indexes о 
related with student reports of their co 
analyses of the latter were performed. 14 of 
3 of 9 factors were significantly correlated 
while none o 
both kinds of residuals of women. The 
training was manifested by an increasing 


aspirational residuals of men, 
significantly correlated with 
disposition to seek advanced 


number of panel members with increase 


f 


from 


8 precollege characteristics. 
changes in dispositions, were cor- 
llege environments, and factor 
33 college press scales, and 
with both behavioral and 
f the scales or factors were 


d exposure to college. 


зр riou studies of the effects of col- 
kc upon aspirations to seek graduate 
Erin degrees have been 
the m to two modes of analysis. In 
k irst, which may be called a “col- 
ea ny” analysis, the under- 
unit uate institution is regarded as the 
‘ies of sampling, and an attempt 18 
of ге to describe the characteristics 
“hi иш which are unusually 
Ea or “low” with respect to their 
ad ү of students who later obtain 
mur degrees or other scholarly 
finos ds (Astin, 1961, 1962b; Knapp & 
19595) q 1952; Thistlethwaite, 19598, 
method, The most frequently expressed 
ети, ological caution in recent pro- 
мена ү studies is the necessity of 
ity E ing differences in student qual- 
Келеш. institutions so that com- 
ет ns on the basis of the attam- 

S of graduates will not merely 
ead 


"The 


омеа Qo TOR reported herein was sup- 


оу А the Cooperative Research 
States TNT the Office of Education, United 
and Wan nent of Health, Education, 
Tesearch are, In addition the early phases of 
tional SA partially supported by Na- 
омадо ee Old Dominion 

е ations os Ford Foundation grants to 

е are ас Merit Scholarship Corporation. 
Consultation eful to Richard L. Gorsuch for 

lon in the analysis of data. 
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reflect diversities in entering classes. 
While there is suggestive evidence that 
both the structural and psychological 
characteristics of college environments 
influence motivation for advanced de- 
grees, none of the positive studies have 
controlled more than a few college- 
input variables. 

The second frequently used mode of 
analysis may be called an “analysis to 
find out who changes.”2 In this 
paradigm the student is regarded as 
the unit of sampling, and an attempt 
is made to correlate individual differ- 
ences with respect to change in degree 
aspirations with actual or reported ex- 
es during college (Astin, 1962a, 


perience 
1963a, 1963b; Thistlethwaite, 1960, 
19624, 1962b, 1963). The present 


h follows the second para- 
ts to control as many as 
characteristics, and 
hypotheses concern- 
f teacher and peer 
student degree aspi- 
obes than 


study, whic 
digm, attemp 
eight precollege 

thereby to expose 
ing the effects o 
subcultures upon 
rations to more searching pr 
have yet been reported. 


2This term has been borrowed from 
Hovland, Lumsdaine, and Sheffield (1949, p. 
316), who first used it to refer to a method 
of studying the relation of film effects to 


population variables. 
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METHOD 


Sample 


Members of the sample were first identi- 
fied as juniors in secondary school when 
they participated in an aptitude test used to 
select scholarship recipients in a national 
scholarship competition. Examinees are 
known to differ from nonparticipating high 
school students in that they more frequently 
planned to enter college, more often had 
parents who wanted them to go to college, 
and more frequently took college prepara- 
tory courses (Dirks & Anderson, 1960). Only 
the 300,000 examinees having test scores 
equivalent to a national percentile rank of 64 
or higher were designated for sampling. A 
10% random sample, stratified by state, was 
drawn, and the first mail survey was sent to 
selected students in 1959, a few weeks before 
they would normally have entered college. 
Returns were received from 26,771 respond- 
ents, who comprised approximately 90% of 
the designated sample. There were propor- 
tionately fewer returns from students resid- 
ing in the South and from students with 
lower aptitude-test scores. The bulk of the 
data from the first survey was analyzed for 
a purpose which does not concern us here. 

The second mail survey was sent only to 
selected panel members who had enrolled in 
one of the 140 most popular colleges and 
universities attended by respondents to the 
initial survey. Names of respondents enroll- 
ing at these colleges were stratified by col- 
lege of enrollment, and random samples of 
60 names were selected from each stratum 
containing more than 60 names, while all 
names were selected from each stratum con- 
taining between 35 and 60 names. These 
procedures yielded a designated sample of 
7,464 examinees to whom a second mail sur- 
vey was sent in 1961, when these stude 


nts 
would normally have completed the second 
year of college. Usable returns were received 


from 58%, or 4,330, of this group. Response 
rate varied by sex and aptitude level: 
Women responded more frequently than 
men, and students with high aptitude-test 
scores responded more frequently than those 
with low test scores. 

The final survey was made in 1963, a few 
weeks before panel members would normally 
have entered graduate or professional school, 
It was limited to students who indicated in 
1961 that they were proceeding through col- 
lege in normal progression. Returns were 
received from 2,919, or 70%, of the desig- 
nated sample. Three response biases were 


present in the returns: First, high-aptitude 
students tended to respond more frequently 
than low-aptitude students; second, there 
were proportionately more returns from 
students with higher grade-point averages as 
reported at the end of the second year of 
college; third, response rate was positively 
related to the student’s degree aspirations as 
reported at the end of the second year of 
college. 

The data to be reported are based only on 
panel members who participated in each of 
these surveys, and the cumulative response 
biases clearly resulted in the selection of а 
panel more disposed to seek advanced train- 
ing than a typical sample of college gradu- 
ates. These biases precluded any attempt to 
estimate from the panel the proportion of 
all college students who continue their train- 
ing in graduate or professional schools, but 
they were not a source of invalidity in our 
attempt to test hypotheses concerning fac- 


tors influencing student motivation for such 
training. 


Criterion Questions 


Included in the final survey were two 
items designed to provide indicants of moti- 
vation to seck advanced training: The first, 
hereafter referred to as a measure of aspira- 
tions, asked the respondent to indicate the 
highest level of education he expected to 
complete (e.g., BA, MA, first professional, or 
PhD degree); the second, hereafter called 
the entry criterion, asked the respondent to 
indicate whether or not he was enrolling 
immediately in a graduate or professional 
school. Responses to the aspiration question 
Were scored with weights of 0, 1, and 2 for 
BA, MA, and professional or PhD degrees 
respectively; responses to the entry question 
were scored with weights of 0 and 1 for non- 
entry and entry, respectively. Since it usually 
was not possible for nongraduates to enter 
graduate or professional school at this time 
only data for the 1,970 panel members 
who had fulfilled all the requirements for 
graduation from college on or before Sep“ 


tember 1963 were included in the analyses tO 
be reported. 


Adjusted Criterion Measures 


Precollege Characteristics. In an earliet 
analysis of data from the first two survey? 
Sex and initial degree aspirations were the 
2 predictors (among 11 on which reasonably 
complete information was available) mo$ 
highly related to degree aspirations after 
years of college, However, subgroup} 
matched with respect to sex and initi? 
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dege aspirations but sorted according to 
aes in aspirations after 2 years of college 
Үт ered with respect to several precollege 
E aracteristics, For example, subgroups show- 
ШЕ Заста in degree aspirations tended 
SE higher aptitude-test scores and more 
"ES og applied for freshman scholarships. 
dant i mothers and fathers of these stu- 
Es oe higher levels of education and 
Es a i higher family incomes. Probable 
uite otk d of study as reported at the begin- 
P dr eee was also highly related to the 
prot bl ier degree aspirations. Ac- 
ist ngly the following precollege character- 
Slies were controlled in this analysis: 
. Sex. И 
- Degree aspirations at the beginning of 
А college (1959). 
. aa tiora] Merit Scholarship Qualifying 
lest (NMSQT) score in 1958. 
- Father's educational level. 
- Mother's educational level. 
; Number of freshman scholarship appli- 
у cations in 1959. 
" Family financial resourees as reported 
g in 1959. 
8. Probable major field of study as re- 
Sine ported in 1959. 
те) I and probable major field of study 
mitior ee classifications, data for these 
append Vere recorded as "dummy уап- 
eni m the manner described by Tatsuoka 
erm en (1963, p. 148). Complete in- 
ation on these precollege attributes was 


ue 


ons 


avaj 

ao for 1,772 of the panel_members 

1903. ad completed college by September 
Total Regression Analysis. Multiple- 


re М 
dieting n equations were developed for pre- 
Member. E criteria for the 1,772 panel 
ictors, "n rom the eight precollege pre- 
tween PN em scores (discrepancies be- 
Were ESA р and predicted criterion scores) 
ables, Th as the principal dependent vari- 
indexes of residuals may be interpreted as 
Changed сре extent to which each student 
"Dg from is motivation for advanced train- 
Status on a level predictable from his 
Standardies selected precollege variables. 
nalyses — and — Cross-Validation 
Method t me (19632) used an actuarial 
College ch, achieve similar control over pre- 
aracteristics, except that he derived 
аш from a standardization 
10n sample The applied it to a cross-valida- 
508 were ме he following additional analy- 
i valence ormed partly to investigate the 
and “nono of residuals based on “optimal” 
Optimal” regression equations and 


partly to analyze the effects of college upon 
men and women separately. 

Random samples of male and female 
panel members (475 men, 412 women) were 
drawn for standardization samples. Data for 
these respondents were used to develop 
multiple-regression equations for predicting 
both the aspiration and entry criterion from 
the precollege characteristics. (Sex was 
omitted since it was controled by per- 
forming separate analyses by sex. Dis- 
crepancies between predicted and obtained 
criterion scores were calculated and are 
hereafter referred to as standardization 
residuals, The foregoing multiple-regression 
equations were then applied to the data for 
the remaining respondents (461 men, 424 
women) who comprised the cross-validation 
sample. Errors of estimate, based upon 
applying these nonoptimal equations to sub- 
jects (Ss) in the cross-validation samples are 
referred to as nonoptimal residuals. In addi- 
tion, the data for cross-validation Ss were 
used to develop new multiple-regression 
equations for predicting each criterion. Dis- 
crepancies between predictions based upon 
the latter equations and the corresponding 
obtained criterion scores may be termed 
optimal residuals. By correlating nonoptimal 
and optimal residuals for Ss in the cross- 

it was possible to study 


validation sample, 
the extent to which the two kinds of residuals 


were equivalent. 


Independent Variables 


One class of measures of college environ- 
ments consisted of college press scales con- 
taining questionnaire items on which the 
student was asked to describe the demands, 
expectations, or activities most characteristic 
of teachers and students at his college. These 
scales may be subdivided according to 
whether they called for description of the 
student's environment as & lowerclassman 
or as an upperelassman. 

Lower-Class Press Scales. Each of these 
scales, administered to panel members at the 
end of their second year of college, con- 
tained 10 dichotomous items, 5 keyed 
so that a “True” response was weighted 
positively, and 5 keyed so that a “False” re- 
sponse Was weighted positively. The “ob- 
jects” of description were presumably col- 
legewide, since students were not instructed 
to limit their reports to their own teachers 
or to their close friends. Unlike Stern's 
(1958) scales, each of these scales was 
devised so that it called for descriptions of 
faculty or students, but never both. 

Upper-Class Press Scales. Each of these 
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TABLE 1 
CorrEGE Press SCALES 


Scales and representative items 


Faculty press (lower-class scales) 


1-1 
1-2 
1-3 
14 
1-5 


1-8 
1-9 


Student press (lower- 


1-10 
1-11 


1-12 
1-18 
1-14 


2-1 
2-2 


Affiliation. (The faculty keeps pretty much to itself 
students.) F 


Directiveness. (Assignments are usually clear and Specific, making it easy for 
students to plan their studies effectively.) T 


Enthusiasm. (The outstanding traits of the faculty are their dedication to and 
enthusiasm for their work.) T 


Achievement. (The professors are really skillful at getting students to work to the 
limit of their capacities.) T 


Compliance. (The faculty usually demands strict compliance with all course re- 
quirements.) T 


Supportiveness. (Teachers are sensiti 


and rarely fraternizes with 


ge offers many really practical courses designed to pre- 
t for his occupation.) T 


class seales) 
Competition, (The competition for high achievement is intense.) T 

stheticism, (There is a lot of interest here in Poetry, 
architecture, ete.) T 


eflectiveness, (Students here have very little interest, in the analysis of value 
e relativity of societies and ethies.) F 


music, painting, sculpture, 


performance 


Independent Thinking, (The 
and 


Students to do graduate work 
1 с d ork.) T 
Y typically dem d Ts to test questions be 
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Table 1—Continued 


Scales and representative items 


S 
tudent press (upper-class scales) 


2-13 
2-14 
2-15 
2-16 
2-17 
2-18 
2-19 


2-20 


Intellectualism. (W i 
nlism. (We sha Й xci i 
стене) п (We shared with each other the excitement of intellectual dis- 
nfavorable Self-evaluations. (M. i 
Ё alu: . (My associates w 
з to pursue their studies than I.) T n 
С ohesiveness. (It was easy to get а group o 
oe going to the movies, etc.) T 
ebelliousness. (They occasionally pl 
Mem у a y plotted some sort of escapade or rebelli 
enlie (We rarely had much time for play or recreation.) F d 
s. (Among my г iates it was i ially i i 
reni g my associates it was important socially to be in the right club or 
Opposition to Faculty Influence. (Mo 
to achieve greater direction, force, and clarity.) F 
penness to Experience. (They often went to concerts and art exhibits when they 


were available.) T 


f them together for card games, talk 


st of them felt their teachers had helped them 


Not m. P ; 
diatas The item correlating highest with each seale iss 
whether agree or disagree responses, T! 


hown in parentheses. The T or F 


espectively, received greater weight in 


Scori i i 
ing the given item. 


20 А 
eren which were included in the final 
they eu to students a few weeks before 
Or professi normally have entered graduate 
keyed ional school, consisted of 8 items, 
weighted so that “Agree” responses Were 
“Disagr positively, and 4 keyed so that 
tively Е responses were weighted posi- 
ratings s item called for Likert-type 
Tating scal egree of agreement on & 5-point 
spondents e. In completing these items re- 
ite LAM asked to describe under- 
Study with eachers in their major fie 
ir sour whom they had taken most of 
eagueg т Ж or to describe student col- 
common] whom they had associated most 
Overlap, zi While some of these scales 
Ded substantially in item content 


Bradu 
the: 


Wi 


New q 
a 
low, 


ith oe s 
most, or esponding lower-class press scales, 


i М 
mensions of college press? 


Both sets of scales (and the item in each 
scale correlating highest with total scale 
score) are shown in Table 1. One of the 
upper-class scales (2-6) had inadequate relia- 
bility and was eliminated from the analysis. 
For the lower-class scales, reliabilities (KR- 
20 coefficients) ranged from .59-.85 (median 
of .73), and the correlations between illus- 
trative items in Table 1 and their respective 
scale scores ranged from 72-92 (median of 
83). For the 19 upper-class scales analyzed, 
d from .57-.87 (median of 
Jations for illustrative 
items ranged from 64-83 (median of .72). 

College Experiences. The 1963 survey in- 
cluded questions about experiences often 
considered critical in shaping the student's 
motivation to pursue à scientific or scholarly 
career. These questions concerned the pro- 
portion of the student's close friends enter- 
ing graduate school immediately, exposure 


of reliabilities range 
44), and the corre 


them were designed to represent 
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ра lists of items in each of the 

teys used and upper-class press scales and 

posited, sc E responses have been 
with the American Documenta- 


to an honors program or to graduate-level 
courses, exposure to research experiences as 
an undergraduate, whether he had won cer- 
tain kinds of honors or awards in college, 
and his extracurricular activities. 
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RESULTS 


Changes in Plans to Seek Advanced 


Training 
Changes in 

training were § 

the panel’s responses 


plans to seek advanced 
tudied by comparing 
to the three sur- 
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veys. Figure 1 shows that at the be- 
ginning of college 5295 of the panel 
members planned to obtain graduate 
or professional degrees, after 2 years 
over 61% reported such plans, and at 
the completion of college 84% expected 
to pursue such advanced degrees. 

The right-hand portion of Figure 1 
shows the percentages of men and 
women who reported they were en- 
rolling immediately in a graduate or 
professional school. It is apparent that 
the aspiration criterion had a lower 
“difficulty level” than the entry cri- 
terion as a test of motivation for ad- 
vanced training. At the same time 
there were no men, and only two 
women, wlio "passed" the difficult (en- 
try) item while “failing” the easy (as- 
piration) item. 

The disparity between terminal as- 
pirations and behaviors was much 
greater for women than for men. Thus, 
78% of the women said they planned, 
but only 27% immediately entered, 
graduate or professional study; in con- 


trast, 89% of the men planned, and 
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Fig, 1. Changes in plans to scek graduate 
or professional degrees, 
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62% immediately entered, such study. 
Plans to enter elementary or secondary 
School teaching were probably the 
most important factor contributing to 
this sex difference. A large proportion 
of the women were education majors 
for whom 1 or 2 year's teaching ex- 
perience is customary before pursuing 
a graduate degree. Thus 21% of the 
women, but only 9% of the men, said 
they were planning to apply for ad- 
mission to a graduate or professional 
school the following year. In response 
to another question, 44% of the 
women, but only 23% of the men, said 
they planned to go to graduate or pro- 
fessional school, “but not this year,” 4 


Predictability of Plans 


Aspiration Criterion. The total re- 
gression analysis yielded a multiple 
correlation of .507, indicating that, at 
best, only about 25% of the variance 
in terminal aspiration levels can be 
accounted for on the basis of the eight 
precollege characteristics, Three pre- 
college characteristics —initial aspira- 
tion level, sex, and NMSQT score— 
accounted for virtually all of the 
predicted criterion variance, Among 
women there was a multiple correla- 
tion of .275 which shrank to .233 when 
the regression equation for the stand- 
ardization group was applied to the 
cross-validation group. Among men the 
multiple correlation was .451 which 
shrank to .402 when cross-validated. 
For both women and men the two pre- 
dictors accounting for the largest pro- 
Portions of criterion variance were iD- 


a 


* Academic disqualifications and financial 
obstacles apparently had little effect upo”? 
the Student's failure to seek advanced train- 
mg immediately, Only 2% of all nonentrants 
applied for admission and failed to receive 
at least one offer of admission, and only 3% 
of the nonentrants applied for financial вир 
Port to attend a graduate or profession® 


school but failed to receive at least one offe" 
of assistance, 
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itial aspiration level and NMSQT 
score. 

moe Criterion. The entry criterion, 
ike the aspiration criterion, was less 
page d predicted for women than 
n men. For women there was a mul- 
"a e correlation of .370, which shrinks 
o .245 when cross-validated, while for 
Wa aa was а multiple correlation 
ES 18, which shrinks to .332 when 
ar e rie The total regression 
of e yielded a multiple correlation 
Со In general the best precollege 
Hum of entry into graduate 
cem tend to be the best predictors of 
зн aspirations. The only notable 
fees id was father’s educational 
with, ‘his variable was associated 
Sn a Significant proportion of entry 
e аа in each of the separate sex 
б m while the unique contribution 
á his variable to predicting terminal 
Spirations was negligible. 


Optimal and Nonoptimal Residuals 


iui correlations between optimal 
ere nonoptimal aspiration residuals of 
or validation Ss were .987 and 973 
7 and women, respectively; the 
entry at ions between the two kinds of 
а, residuals were .973 and .970 for 
the and women, respectively. Clearly 
спай of the eross-validation tech- 
ier in the calculation of the cri- 
riall n residuals would not have mate- 
Bed changed the results. Since the 
ен regression equations controlled 
Cisel ege characteristics most pre- 
Бог Separate sex analyses are re- 
tio below only for the standardiza- 
n samples. 


Factor 
eo Structure of College Press 


0 
Tow analysis by the method of 
of comm axes, using iterated estimates 
Varima: nunalities in the diagonals and 
X orthogonal rotation, was used 


to identify the independent dimensions 
of college press represented in the re- 
maining 33 college press scales. To 
assess the stability of the emerging 
factor structures, independent factor 
analyses were performed for male and 
female respondents in the standardiza- 
tion and cross-validation samples, re- 
spectively. Ten factors were extracted 
from each of these four analyses. In 
addition, a similar factor analysis was 
performed on the intercorrelation ma- 
trix based upon scores of both men and 
women in a pooled sample containing 
both standardization and cross-valida- 
tion Ss. However, only nine factors 
were extracted in the latter analysis. 
Similarities in the factor structures 
obtained in the five factor analyses 
can best be indicated by summarizing 
some of the coefficients of congruence 
(Harman, 1960, p. 257) calculated for 
comparing each factor of one analysis 
with all the factors in another anal- 
ysis. For example, Factor 1 in the 
analysis for standardization males had 
its highest coefficients of congruence 
with the first factors in the analyses 
for cross-validation males, standard- 
ization females, and for the pooled 
sample, and parallel statements can be 
made for Factors 2 through 8. Simi- 
larly, the first factor in the analysis for 


standardization females had its highest 


coefficient of congruence with the first 


factors in the analyses for cross-vali- 
dation women and the pooled sample, 
and parallel statements may be made 
for Factors 2 through 8. When analo- 
nts of congruence were 


gous coefficier 
computed for the ninth factor, Factor 
9 in one analysis had its highest degree 


of similarity with Factor 9 in another 
analysis in four out of five of the 
comparisons. The exception: Factor 9 
for standardization males  corre- 
sponded most closely to Factor 1 for 
standardization females. The coeffi- 
cients of congruence for the matching 
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factors ranged from .73-.99 (median 
of .975). The high degree of consist- 
ency and the high coefficients of con- 
gruence indicate that the factor 
Structure is extremely stable, partic- 
ularly for the first seven factors. 
It will be recalled that some of the 
upper-class scales had item content 
similar to lower-class scales, The fol- 
lowing similar pairs of scales were in- 
cluded in the two administrations of 
press scales: faculty press for com- 
pliance (1-5, 2-11), faculty affiliation 
(1-1, 2-10), faculty enthusiasm (1-3, 
2-4), faculty press for achievement 
(1-4, 2-2), faculty press for inde- 
pendence (1-8, 2-3), student esthet- 
icism (1-11, 2-20), and student intel- 
lectualism (1-14, 2-13), However, the 
corresponding scales of each pair typ- 
ically loaded highly on different fac- 
tors. This finding was used as a basis 
for distinguishing between lower- 
and upper-class press factors,5 
Press scales descriptive of facul 


usually loaded highly on 


class 
Also, 


g (.68) ; 2-2: 
"It should be noted, h 
neither the items in the mide ae 


Were ex. 
late phases of 


Faculty press for high academic stand- 
ards (.62); 2-5: Faculty press for ad- 
vanced training (.54); 2-1: Faculty 
evaluations of ability (.51); 2-19: 
Student opposition to faculty influence 

—.49). 

The loadings Suggest a tendency to 
evaluate teachers in terms of a fairly 
global 
factor seems similar to the "skill" fac- 
tor reported by Isaacson, McKeachie, 
Milholland, Lin, Hofeller, Baerwaldt, 
and may 
in student 


: hypothesis 
is that this clustering arises from the 


faculty attri- 
butes in fact co-vary as students per- 
At present we 
have no clear evidence for a halo effect 
in the sense of Spuriously high corre- 


attributions. It 


thusiasm, achievement, and independ- 
all load highly 
r-class counter- 
(1-8, 1-4, 1-8) 
do not have high loadings on this fac- 
tter scales load 
actor. If one in- 
p as representing 
errors (halo ef- 
ecomes necessary tO 
different pattern of 
S was made in rating 
milar items 2 years 


earlier, 
Factor 2 may be called “lower-class 
umanistic-intellectual press” since 
the scales 


defining this factor include 
only lower-class Seales: 1-11: Student 
press for estheticism (.77) ; 1-4: Fac- 
ulty press for achievement (73) ; 1-7: 
“aculty press for humanism (.68); 17 
10: Student Press for competitio? 
(65); 1-8: Faculty press for inde- 
Pendence (63); 1-19: Student pres? 
for Teflectiveness (.61) ; 1-14: Student 
Press for intellectualism (.54). Thi 
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was the only factor to emerge that 
‚= defined by scales clearly deen- 

m of both faculty and students. 
m. third factor, "faculty press for 
the ate and vocationalism," was 
fie a one defined by scales descrip- 
im both lower-class and upper- 
ч environments: 1-5: Faculty 
Р SS for compliance (.76); 1-9: Fac- 
eee for vocationalism (.59) ; 2- 
Ke aculty press for compliance (.51). 
ae arg of Factor 4, “student 
m ма oe seems so clear that no 
esa need be made for its inter- 
78) o» 2-17: Student playfulness 
(58); See Student rebelliousness 
(51). d 15: Student cohesiveness 
(36) ue Student press for status 
aaah ince these scales had no coun- 
aiino! | in the lower-class scales, we 
sarily еге this factor as neces- 
environ ENS of only upper-class 

| 
quist 5 has been called “иррег- 
ч Student intellectualism," since 
bi» НБ scale оп student press 
PME aa ectualism had only negligible 
bon, Е on this factor: 2-18: Student 
t ds intellectualism (.81) ; 2-20: 
ence от ез8 for openness to experi- 
ú qne Sixth factor, “lower-class fac- 
ш lation," is clearly descriptive 
Чагы Кз дыз press, since the upper- 
пері Ч ty-affiliation scale had only 
Расці. е loadings on this factor: 1-1: 
: pad pee for affiliation (.73) ; 1- 
Facult llty supportiveness (.66); 1-3: 
ulty di enthusiasm (.58); 1-2: Fac- 
Pact ness (.87). 

peti ош» 7, “upper-class student com- 
scale: 2 ,4 25 defined by a single 
able self 4: Student press for unfavor- 
class sudo а (.43). The lower- 
Petition га ip student press for com- 
оп this ho. a negligible loading 
epret „бан confident about the in- 
een call n of Factor 8, which has 
ed “lower-class social con- 


formity." It is best defined by the fol- 
lowing scale: 1-13: Student press for 
social conformity (.35). 

In the analyses for cross-validation 

men and women, Scale 1-13 had load- 
ings of .62 and .69, respectively, on a 
similar factor. However, in the "total" 
factor analysis student press for in- 
tellectualism (1-14), faculty press for 
compliance (2-11), and student press 
for reflectiveness (1-12) had loadings 
(.32, .31, .30, respectively) almost as 
high as that for Scale 1-13. It is not 
clear what these scales have in com- 
mon. 
The ninth factor, "affiliation of fac- 
ulty in major field," was defined by 
the following scales: 2-10: Faculty 
press for affiliation (.77) ; 2-9: Avail- 
ability of faculty to students (.76). 

The emergence of two independent 
faculty-affiliation factors (Factors 6, 
9) was unexpected. This factor struc- 
ture emerged much more clearly for 
men than women, some of whom tended 
to rate teachers as if the item content, 
of Scales 2-9 and 2-10 was indistin- 
guishable from Factor 1 scales. 


College Press and Changes in Plans 


To the extent that the correlations 
between college press variables and the 
residuals are significantly different 
from zero we have evidence confirming 
the hypotheses that the indicated 
kinds of college press influence dis- 
positions to seek advanced training. 
We can best avoid overinterpretation 
of these correlations by confining our 
attention to those press variables hav- 
ing significant correlations with both 
entry and aspiration residuals. Four- 
teen of the 33 press scales were signifi- 
cantly correlated with both entry and 
aspiration residuals of standardization 
males, while none of the scales were 
significantly correlated with both 
kinds of residuals for standardization 
females. When the correlation between 
each press factor and the residuals was 
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TABLE 2 
CORRELATIONS BETWEEN COLLEGE Press VARIABLES AND RESIDUALS 


Standardization males 
= 475)" 
Press variable 
Entry residual Aspiration residual 
F-1 Excellence of faculty in major field (-09*) (.10*) 
2-8 Excellence of faculty teaching ‚12 da 
2-4 Faculty enthusiasm .10* da 
2-7 Faculty adequacy as positive role models Ше ик 
2-3 Faculty press for independent thinking 239% .14 E 
2-1 Faculty evaluations of ability 16** de 
2-19 Student opposition to faculty influence —.13** —.15* 
F-2 Lower-class humanistic-intellectual press 
1-11 Student press for estheticism 13** .09* 
1-7 Faculty press for humanism .15** .10* 
1-12 Student press for reflectiveness .13** .l4** 
1-14 Student press for intellectualism .14** .11* 
F-3 Faculty press for compliance and vocationalism (—.10*) (—.11*) 
1-9 Faculty press for vocationalism —.11* —.11* 
F-4 Student camaraderie (—.10*) (—.10*) 
F-5 Upper-class student intellectualism 
2-13 Student press for intellectualism .16** .19** 
F-6 Lower-class faculty affiliation 
1-6 Faculty supportiveness Ar .12* 
1-3 Faculty enthusiasm „10* „117 


^ Factor correlations, in parentheses, are sh 


* p < .05 (two-tailed test). 
** р < .01 (two-tailed test). 


caleulated by Dwyer's (1937) method 
for determining the factor loadings of 
additional variables, three of the fac- 
tors were significantly correlated with 
both residuals for men while none were 
significantly correlated with both re- 
siduals for women. For brevity only 
the significant press correlations for 
standardization males are shown in 
Table 2. 


College Experiences and Changes in 
Plans 


There was no sex difference with 
respect to the number of “experiences” 
significantly correlated with both en- 
try and aspiration residuals, and Table 
3 summarizes only the significant cor- 
relations for members of the total 
sample who responded to each experi- 
ence item. These results provide sup- 
port for the hypotheses that the stu- 


own only when significant. 


dent’s disposition to seek advanced 
training is strengthened by association 
with peers having high educational as- 
pirations, by teachers who provide fa- 
vorable evaluations, by undergraduate 
participation in research, by participa- 
tion in honors programs, and by win- 
ning social recognition for academic 
achievement. 


Discussion 


It is diffcult to assess the extent to 
which response biases account for the 
extraordinary growth of interest i? 
advance training shown by panel 
members during the college years: 
However, other survey results suggest 
that similar, though less marked, in^ 
creases would be observable in les® 
selective populations of undergradu- 
ates who complete college. Thus Davi? 
(1963) found that 77% of the under- 


C = G 
OLLEGE ENVIRONMENTS AND STUDENT ASPIRATIONS 45 


ConnEL. Mies 
ATIONS BETWEEN x 
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Wife et oe gd m: in college entered grad- 
А ssional school i iately 
Und, evaluations a " = 
н ше grade-point average 20** P 
qum experience 7 i 
r lel s. "Т 
Сое орва in original research —.09** —.08** 
ет D research project (e.g., senior thesis) ,10** -10** 
articipated faculty member as research assistant 10** j12** 
А speserei r summer research training program 
rov i 
onors progai government or foundation .08** -10** 
ер in honors program or** Joe 
Р раг more graduate-level courses аз ап under- 
= *»* ** 
he and honors ы Е 
mud honors = —.09** 
Sàn ag Dean's List dre i 
lected M with honors „08у. то 
Slécted i. PS Beta Kappa do^ po 
on аа ener society other than Phi Beta Kappa die Am 
arship on basis of academic ability 07" #05" 


^N 
8 for c к 
orrelations were less than 1,772 because of nonresponsi 


cerning 
«E experiences 
xa P < 05 (two-tai 
pc gl wo-tailed test). 
:01 (two-tailed test). 


Srady 
th ates he sampled at the end of 


e n 

blans bw year of college reported 
Slona] moss graduate or profes- 
(1959) su y. Gropper and Fitzpatrick 
eginnin rveyed undergraduates at the 
that Бе of the senior year and found 
training: planned to seek advanced 
| ese uni, ee three-fourths of 
hej вов reported that they made 
ced 4 ns to seek, or not to seek, 
‘taining during the under- 

esults years, 
College ts of the factor analysis of 
sin future a scales have implications 
iod e udy. We have seen that 
ti 8 Scales neh pice and upper-class 
FUN S and t ad relatively low correla- 
Actors д ended to load on different 
So press scales descriptive of 


e to specific items con- 


ties and of student peer 
lly loaded highly on dif- 
These findings suggest 
f defining the object 
llege press scales 


college facul 
groups typica 
ferent factors. 
the desirability o 
of description in co. 
so that potentially important differ- 
in college subeultures may be 
y revealed. Also these data 
suggest that the teacher and peer sub- 
cultures to which students are exposed 
are typically quite different for the 
lowerclassman and the upperclassman. 
]f it were possible to manipulate col- 
lege environments 80 that there was 
greater constancy in the college press 
positively related to increased aspira- 
tions, then we might expect colleges to 
have even greater impact upon the 
educational plans of undergraduates. 


ences 
more clearl 
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It should be noted that the magni- 

tude of the theoretical relationships 
between college press and changes in 
degree aspirations cannot be ade- 
quately estimated from the correla- 
tions in Tables 2 and 3. The observed 
correlations were undoubtedly reduced 
by the restricted range of talent of the 
panel members—response biases and 
the method of selecting the sample 
produced a relatively homogeneous 
group of talented students—and by 
the fact that no corrections were made 
for attenuation due to unreliability in 
the independent variables and particu- 
larly in the criterion measures—the 
use of residual scores entailed consid- 
erable criterion unreliability. It seems 
clear that these correlations would 
have been larger for a more hetero- 
genous group or if corrections had been 
made for attenuation. 

For the purpose of testing the hy- 
potheses it is sufficient to note whether 
the correlations are statistically sig- 
nificant. These results strongly sug- 
gest that at least some of the types 
of college press and experiences investi- 
gated caused men to change their mo- 
tivation to seek advanced training. 
The failure to confirm similar hypoth- 
eses for women is congruent with pre- 
vious studies (Thistlethwaite, 1962a, 
1962b). 

Finally, brief comment should be 
made on alternative hypotheses for ex- 
plaining these correlations, First, one 
might argue that one or more uncon- 
trolled, precollege characteristics could 
be correlated with the kinds of teach- 
ers and peer groups one will seek out 
in college, and that these character- 
istics are reflected in the measures of 
college environments. Assuming these 
characteristics are also correlated with 
the residuals, then one might account 
for the observed associations through 
an unmeasured “third” variable. Sec- 
ond, there is the possibility that some 


of the reported experiences or “meas- 
ures” of college environments may be 
indicants of a newly acquired disposi- 
tion to seek advanced training. Thus 
once a student resolves to seek ad- 
vanced training, he may take courses 
under more demanding teachers or as- 
sociate with peers who share his career 
goals. This interpretation simply re- 
verses the direction of causality we 
have assumed. Third, there is the possi- 
bility that the measures of college press 
share “method” variance with the cri- 
terion measures, since both sets of data 
were obtained from the same Ss. The 
last hypothesis is not supported by the 
results of McFee (1961), Pace (1963), 
or Saunders (1962). While we may 
claim that the number of precollege 
characteristics controlled in the pres- 
ent study exceeds that controlled in 
any previous study, it was still not 
possible to rule out all rival hypoth- 
eses. This condition does not invalidate 
the tests of hypotheses; rather it 
makes these tests less probing than 
might have been the case under ideal 
conditions (e.g., if we had available ап 
experimental college in which we coul 
randomly assign Ss to different student 
and faculty subcultures). We find it 
reassuring that a variety of causal 
hypotheses concerning the effects O 
college upon the male student's dis 
positions were exposed to disconfirm- 
tion by a fairly searching probe an 
Were not disconfirmed. 
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EVALUATION OF FIVE METHODS OF TEACHING 
CONCEPTS’ 


DONALD M. JOHNSON anp R. PAUL STRATTON 
Michigan State University 


Most laboratory studies of concept learning employ a classification 
method for training and for testing, but teachers employ other 
methods as well: defining the concept, using it in a sentence, and 
giving synonyms. The 4 methods and a mixed method were evaluated 
by comparable training and testing. All single methods were about 
equally effective; the mixed method was best. Transfer from train- 
ing by 1 method to tests corresponding to other methods was 100%. 


Most concept experiments require 
the subject (S) to learn to classify ob- 
jects by practice with positive and 
negative instances and to label the 
positive instances with a nonsense 
syllable. Theoretical discussions often 
take this procedure as typical of con- 
cept learning. Carroll (1964) has ques- 
tioned the relevance of such experi- 
ments to concept learning in school 
because the label is usually a standard 
word with a network of associations, 
and because teachers use several 
methods, some of which are more de- 
ductive than inductive. Serra (1953) 
was able to list a variety of methods 
by which teachers introduce concepts 
in elementary school, and inspection 
of a college textbook will also dis- 
close a diversity of methods. There are 
at least three, in addition to the clas- 
sification method, that have been 
frequently used by teachers and text- 
books, but have not been experimen- 
tally evaluated: defining a concept, 
using it in a sentence, and giving syno- 
nyms. 

Experimental evaluation of these 
methods requires different tests of con- 
cept achievement appropriate to each, 
and different types of test item have 


1 Supported in part by a grant from the 
National Science Foundation for undergrad- 
uate research participation. Thanks are due 
to Henry Askew for expert assistance in 
scoring. 
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been devised for regular testing of con- 
cept achievement in school also. Cron- 
bach (1942) described several types of 
test item for diagnostic vocabulary 
testing. Hastings (1941) constructed 
six tests, each of a different item type, 
for testing achievement of 35 standard 
mathematics concepts and attained 
good reliabilities for each test. These 
studies have been concerned with 
blocks of related concepts within an 
area of knowledge, rather than single 
concepts, but the results suggest that 
achievement of single concepts may 
also be evaluated by different tests 
which correspond to different teaching 
methods. . 
Evaluation of different teaching 
methods raises the critical question ° 
amount of transfer. Classification 
practice may build up specific associa- 
tions that suffice for a classification 
test, but lack the generality needed for 
a definition test. However, a recent ех” 
periment with 12-year-old childre? 
(Johnson & O'Reilly, 1964) fou? 
more transfer from classification trai- 
ing to definitions, when the definition® 
were carefully scored, than previous 
research had indicated. Since a set tO 
learn concepts facilitates concep 
achievement (Reed, 1946), college SS) 
who are used to studying concept? 
may show even more transfer. The 
question can be systematically 1°” 
vestigated by giving all tests to all 68. 
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н proposes, therefore, to con- 
ms ш tests of a few concepts, to 
ie hese concepts by four corre- 
Mim ing methods and by a combina- 
АВ. € and to compare 
{адлы те l tests following all 


Ткз TS 
ST Construction AND ANALYSIS 


Nasa “йаш concepts, designated 
à preli ds, are relevant. The results of 
test at ping 4 study with a definition 
test Kans a multiple-choice synonym 
levels of diffieul indication of suitable 
dicis of difficulty, Words with Thorn- 
land ТЕ, frequency counts between 
9r colle could be expected to be easy 
between 1 students. Words with counts 
he feu 6 could be expected to 

‘ E 
ini, LORD was based on frequency 
described ility to the four tests to be 
lectives - Only nouns, verbs, and ad- 
used, Ww With four synonyms could be 
ing in н, ds with more than one mean- 
nical wo ridged dictionary and tech- 
rently p, 298 that some Ss might cur- 
the АУБ. studying were rejected. From 
editing P eae of items that survived 
Chose fo le experimenters arbitrarily 
Vile, ЖШ easy words: controversy, 
ficult 1c, and vicinity, and four dif- 
chide words: alacrity, altercation, 

› and opulent. , 


Tests 


There 
е, e-response t two tests of the open-end or 
multipl, ype and two of the objective 
š fini noice type. 
woke d ч Each word was printed with 
Cre: «eq, Short response. The instructions 
Plain brief are four words to be defined. 
h m each word means. Try 
nic, "ences Th you have to guess." 
пес іза wa е incomplete-sentence tech- 
ed qud of embedding a word in con- 
e Pic The instructions were: 
tach ed, ju incomplete sentences to be 
Sentence add a few words to give 
Ne even „а meaningful ending. Try 
Wo pital you have to guess.” There 
ences for each concept, and 


the eight sentences were pri in i 
printed in irregula: 
order. Two examples follow: “A Genio 
versy ceases when... .” 
“Something rustic would probably be found 
” 


Classification. Four short phrases were 
written to describe objects or events classifi- 
able under each of the four concepts, to- 
gether with four unrelated phrases. These 20 
items appeared in irregular order with the 
following instructions: 

Most, but not all, of the following items 
can be classified under four concepts: 
rustic, vile, controversy, vicinity. Read 
each item and mark each on your answer 
sheet thus: rustic 1, vile 2, controversy 3, 
vicinity 4, none of these 5. 

Three sample items follow: “Two men of 
differing opinions discuss the social value of 
a book." “A weathered old barn.” “An ex- 
criminal becomes a respected member of the 


community.” 
Synonyms. Thi 
test requires S to c 


s conventional vocabulary 
hoose a synonym, but 10 
choices were presented rather than the usual 
5 in order to increase the reliability of the 
single item. Four such items were constructed 
for each concept; the 16 items appeared in 
blocks of 4, one for each word, in alpha- 


betical order. 

The tests were administered in the order 
given above to reduce the opportunity for 
learning from the alternatives on the mul- 
tiple-choice tests. The two free-response tests 
were printed on separate sheets; each sheet 
was collected when completed. Time required 


was about 40 minutes. 


Subjects 
Group 1, which had the easy concepts, 
consisted of 63 students in two classes in 
general psychology at Michigan State Uni- 
had the difficult 


versity. Group 2, which 4 
concepts, consisted of 75 students in abnor- 


mal psychology. One foreign student was 
climinated from each group. 


Results 

The definitions were rated on a scale 
of 0-4 by two judges, using ordinary 
dictionary entries as criteria. The two 
sentence completions for each concept 
were treated as à unit and rated on a 


scale of 0-4 by the two judges because 
evaluation of a response seemed to be 
more stable when it was read in con- 
junction with another response related 
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to the same concept. The criteria of 
judgment were based on the instruc- 
tions to write meaningful endings and 
the assumption that Ss would try to 
demonstrate their knowledge. For both 
easy and difficult concepts, and for 
both definitions and sentences, the cor- 
relations between scorers were satis- 
factory for present purposes, ranging 
from 60-97 with a median of .78. The 
sums of the two ratings were used in 
the analysis; hence the scores for each 
test ranged from 0-8. 

The data from Classifications and 
Synonyms were treated as four-item 
tests with reliabilities computed by 
Kuder-Richardson Formula 20. For 
the easy concepts errors were infre- 
quent, and reliabilities ranged from 
.19-.72 with a median of .48. For the 
difficult concepts the range was .65- 
.89 with a median of .79. The scores 
were doubled so that the range of 
Scores on each test extended from 0-8. 


EVALUATION or TEACHING METHODS 


A comparison of methods requires 
that each be used separately. There- 
fore four sets of teaching materials or 
programs, corresponding to the four 
tests, were prepared for teaching the 
four more difficult concepts. A mixed 
program was also prepared. Mean 
scores on the four tests of the four 
difficult concepts ranged from 2.6-5.2 
as compared to a maximum of 8, hence 
there is room for improvement on each. 


Materials 


Definitions. Definitions were written, with 
the help of several dictionaries, to charac- 
terize each term in a Specific way and to 
place each in a higher-order class, An ex- 
ample follows: 

When two or more 
ent opinions, 
each other, th 
tion. Thus an 
action charac 
of opposing а 
tion of alterc 


people express differ- 
get excited, and contradict 
e event is called an alterca- 
altercation is a social inter- 
terized by heated exchange 
rguments. Now write a defini- 
ation in your own words, 


The four definitions, with spaces for the 
responses, were printed on one side of one 
sheet of paper; hence all the material re- 
mained in view. А 

Sentences. Russell and Fea (1963) point 
out that wide reading is universally recom- 
mended as a method of building vocabulary, 
but that direct research on the value of 
reading in increasing vocabulary has yielded 
disappointing results. And the reason, they 
suggest, is the practice of trying to learn 
unfamiliar words from unfamiliar context. 

For present purposes a short story of 174 
words was written in which each of the four 
words appeared twice. The instructions were 
to read the story and learn the four words. 
Following Russell and Fea, the context was 
made easy; to most college students only 
the four words were unfamiliar, Then Ss were 
given four incomplete sentences with the 
request to finish them, using each of the 
four words once. All the material remained 
in view. 

Classifications. Short descriptions of ob- 
jects and events, similar to the test materials, 
were arranged in blocks of five, one classifi- 
able under each of the four concepts, and 
one irrelevant. The four words and "none" 
were printed at the top of each block with 
instructions to classify each item by writing 
the appropriate word on a blank line next 
to the item. One such block appeared on the 
first page of a booklet, and the next page 
gave the answers to the first block, followed 
by a second block of five items. There were 
six blocks of 30 events to be classified, and 
correct answers were given to all. The ar- 
rangement of the booklet made it easy 10 
turn back and check answers. 

Synonyms. The first page of the booklet 
for the synonym training told Ss that their 
task was to learn the meanings of four con- 
cepts. Four short statements appeared next: 

Alacrity means eagerness,” “Altercation 
means squabble.” *Chide means to criticize- 
Opulent means luxuriant,” Then each of the 
four concepts appeared next to a blank space 
and four other synonyms were given, namely» 
reproach, quarrel, lavish, and promptness» 
to be matched with the four concepts bY 
Writing one synonym in each blank. The next 
Page gave the answers to these and another 
block of four Synonyms to be matched tO 
the same four concepts. Four blocks of fou” 
Synonyms were given, followed by answers 
ӘП Succeeding pages. The arrangement 0 
the booklet facilitated checking of answers: 

Mixed Program. The review by Serr? 


(1953) reported that teachers commonly 086 
Several 


methods together. Johnson and 
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OReilly ( 
ынны, fy pone found that children with 
us Base) assifying birds and defining the 
classifying more than children with only 
Pose that део It is reasonable to sup. 
equated, ^ yeu if the amount of training a 
е more Pa ia of methods me 

е Mixed 4 than any single method. 

aterials used Program was constructed of 
л i ndasa in the other four programs 
dun abridged оце The definitions 
nce in the ee Шеш appeared only 
5 а sentence. Two syn- 


9nym 
lu S and one ex, 
Stration ieee were given. An il- 


Bet | 
mean 
Moth 
Ing wi 


‘or Sas m 
Sific, A 

formul mie AE correct answers. 
ation of те ш required the 
ie om tin rules in the prepara- 
s виа ыа. The four a 
not include distinct. The definitions 
d nor eee The words in 
Беноа ра ася definitions or 
Н tations ed. In respect to sen- 
or teachin, and synonyms the 
his rule n and testing should be 
infition „definition. oes not apply to the 
oe Metho) ап ас S because memorizing & 
“ашау Mod. Е cepted part of this teach- 
bodies. od bie Program should be an 
ai S. М sentativ: В 
tui d, ыз last is С of the method it 
ror tt the attem goal that cannot be 
maxi pt was made to make 

o, Потап va effective. 
Completio hould require the same 
e fusis a ences in reading 
after в is criterion difficult to 
M Wo nts it e preliminary trials 
uld АЕ реа that each 
Students. about 12 minutes for 


S op. and testi 
Bethe; тро Qo ang were conducted in 
er win fve ae in general psy- 
an irr з of teaching materials; 
relevant, set of materials 


t, ; 
ог id neighboring Ss had 
as stopped and papers 


were collected at the —€— 
x all Ss had one “= 

he tests were given 9 da s later. 
of Ss on either day а the IE 
the five method groups to 22-28. The con- 
trol group Was increased to 40 by the addi- 
tion of Ss who were present on the test day 
but absent on the teaching day. Scoring of 
the Definitions and Sentences was carried 
out by the two judges who had attained the 
interjudge agreement mentioned above. The 
sums of the two ratings were used. 


Results 


Table 1 shows the scores obtained 
by all groups on the separate tests and 
also the totals. The maximum score on 
any test of the four concepts is 32, and 
the maximum total is 128. In respect 
to total score all method groups ex- 
ceeded the control group. These differ- 
ences are large and, as Table 2 shows, 
highly significant. 

The group that received the Mixed 
program achieved higher total scores, 
as expected, than any group that re- 
ceived а single kind of training. These 
differences are significant with the ex- 
ception of the group that had training 
on sentences, and this difference wou 


be significant by a one-tailed test. 
that received & 


Within the four groups 
single kind of training there are no 
significant differences. For this reason 

led to form & 
total of 
between this 
mixed group is 


The difference 


101.4. 
and that of the 


n 
or Se 


8, 
similar to hose for tot | 
the differences a e ж 
ups di better th e 
M d on all tests; and tl хо вош 
did better than аду. ing ym 
roup. O Definitions te x 
Eo ае тео group 1 ignifican у 
verior t other, b 6 when the 
prd oled, the mean of the 
i on p erior he con rol 
ер 5 01) and the mixed group 
gro 
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TABLE 1 из 
7. E 2 SCE EVEME 
STANDARD DEVIATIONS OBTAINED ON Four Tests or Concept ACHI 
шаа BY Sıx Groups WITH DIFFERENT TRAINING METHODS 


Achievement scores 
ii * Def. Sen. Class. Syn Total 
1 
2 3.0 28.0 27.9 99. 
Definitions 28 E Kr 5.3 5.3 22.6 
5.8 
23.6 25.2 28.4 28.7 105. 
вам и E 54 4.0 4. 17.1 
102.8 
ifieati 24 21.4 24.2 28.4 28.8 
Classifications rie ea e ds 21.0 
97.9 
25 19.9 24.3 26.1 27.6 7 
ишн ° 9.2 6.8 4.4 4.6 22.1 
101.4 
8 pooled 103 21.3 24.1 27.7 28.3 
TERR 8.5 6.2 1.4 4.6 20.8 
3.6 
Mixed program 22 26.5 27.4 29.3 30.5 E A 
6.5 5.3 3.2 2.0 ; 
Control 40 12.8 17.6 23.5 23.4 ar 
9.9 8.3 8.0 9.0 a 


is superior to the pooled group (p < 
01). The same statements can be made 
for the Sentences test and the Syno- 
nyms test. On the Classifications test 
the difference between the pooled- 
group mean and the control-group 
mean is significant (p < .01), and the 
difference between the mixed-group 
and the pooled-group mean yields a t 
of 1.56. In general it is safe to say that 
each kind of training was effective in 
raising scores on all tests above the 


TABLE 2 


SIGNIFICANCE or DIFFERENCES BETWEEN 
TOTALS SHOWN IN TABLE 1 


Method groups | Sen. | Class. Syn. | Mixed Control 
Definitions 1.23 -61 -20 | 2.95** 
entences -55 | 1.42 | 1.88 
Classifications -80 | 2.19* 
Synonyms 3.12** 
4 groups pooled 2.66** 
Mixed program 

* p < 05. 

p< ш. 


control group and that the mixed 
training was best. à 
Another way to look at the data ob 
to examine the mean scores on ea 
test of each concept. Each of the ^ 
Concept means for the pooled eo 
exeeeded the corresponding mean 16 
the control group, and 15 of et 
means for the mixed group excee 


d 
the corresponding means for the pool? 
group. 


Discussion 


The results show that construction 
of short tests of single concepts It 
feasible. The tests of the more ap 
concepts yielded 16 reliable scores е 
About 40 minutes; each score pe Ў 
knowledge of а single concept by а iye 
ferent response, and each is sensiti 
to the improvement resulting O i 
small amount of practice. The s™ ed 
standard deviations for the me 
group on Classifications and Sy” 
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nyms suggest a ceiling effect, but this 
effect was not serious enough to pre- 
nm group from exceeding the 
| Concepts of this kind сап be taught 
а college students equally well by dif- 
erent programs of instruction which 
require different responses. The supe- 
riority of the Mixed program supports 
the common practice of teachers and 
textbook writers. 

Ve question about transfer may be 
ай in this form: How much of 
үа was learned by one method trans- 
на to the tests constructed of mate- 
is I in the other methods? Since 
a s er effects are usually small, one 
mc assume that each of the four 
A p method groups would get its 
wi score on the test corresponding 
: hat method. The results of Table 1 

в not support this assumption. The 
кешр that had practice in writing 

efinitions actually was below the 
Pooled-group mean on the Definitions 

est, but above the pooled-group mean 
on Classifications. The synonyms 
group was the lowest of the four on 

Ynonyms; it was above the pooled- 
up mean only on Sentences. Over- 
tior there is no evidence of specific rela- 
А ng betwcen training method and test 

Chievement; the picture is one 0 

00% transfer. 
io interpretation for the large 
€ nsfer effect is that S sets concept 
muU ai as his goal, varying his 
ín ie on his own initiative and test- 
з imself so that the intended differ- 
ish c urs training methods van- 
oi i hus each method takes over some 

| advantages of the mixed method. 
ane her possibility is that the transfer 

urs on the testing day when S treats 


the four tests as problems and uses 
whatever he can recall to solve them. 
One with training on sentences may 
recall the story, for example, and use 
this information to formulate a defini- 
tion or to choose a synonym. Probably 
transfer occurs during both learning 
and testing. 

The matter of interference should be 
mentioned. One might suppose that if 
S is given a definition or a synonym, 
there would be no more for him to 
learn. But he is given four definitions, 
or four synonyms, as well as verbal in- 
structions; hence some interference 
can be expected. This may explain why 
the Sentences method, which depends 
on learning from context, usually con- 
sidered inefficient, was as good as any. 
The continuity of the story probably 
reduced the amount of interference. 
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IMPORTANCE OF PROBLEM HETEROGENEITY 


TO PROGRAMED INSTRUCTION' 


ROSS E. TRAUB 
McGill University 


i ifferent kinds of problems were used to provide ex- 
voce in subtasks of a more complex task. The task was 
graphical integer addition. It was taught by a learning program to 291 
6th-grade children who were divided into 3 groups matched for arith- 
metic ability. The program of 1 group contained 20 heterogeneous sub- 
task problems. Another group got 20 homogeneous problems, and a 


control group got 20 irrelevant, problems. After finishing the program, 


all students worked 26 graphical integer-addition problems. These 


were performed best by the heterogeneous-problems group, and this 
result was found to be independent of student aptitude. Other re- 
sults suggest that the heterogeneous subtask problems were better be- 
cause they reduced the probability of making stereotyped or omitted- 
response errors, 


ee 


In developing self-instructional pro- 
grams for relatively complex tasks, 
such as solving linear equations in one 
unknown or adding signed numbers, a 
programmer faces the problem of 
identifying the skills needed to per- 
form the task, Guidance in dealing 
with this problem may be obtained 
from the conceptual framework sug- 
guested by Gagné (1962). This frame- 
work consists of the analysis of a task 
into a set of subtasks, all logically 
necessary for the performance of the 
task. From this point of view, the skills 
needed to perform the task are those 
required for working the subtasks. 
Programing then involves devising 
ways to provide experience with the 
subtasks, the object being to develop 
the separate subtask skills and pro- 


* This study is based on a dissertation sub- 
mitted to the faculty of Princeton Univer- 
sity in partial fulfillment of the requirements 
for the PhD degree. The research was con- 
ducted while the author was an Educational 
Testing Service Psychometrie Fellow. Finan- 
cial support for the study was obtained from 
Educational Testing Service and Office of 
Naval Research Contracts Nonr-1858 (15), 
Harold Gulliksen, principal investigator, and 
Nonr-2214 (00), John W, French, principal 
investigator. 


mote transfer of those skills to the 
performance of the task as a whole. 

Experience with subtasks may be 
provided in at least two ways. One is 
to use instructions that tell about P 
subtasks (Gagné, 1962). An additiona 
device, commonly used in teaching, i$ 
to give practice solving problems 0 
the subtasks, 

Several studies have used problems 
to provide subtask experience (Gagne, 
Mayor, Garstens, & Paradise, 1902; 
Traub, 1963; Tuckman, 1962). 18 
general, these studies show that ex- 
perience obtained from instructions 
and solving problems results in bette? 
performance of the task as a whole 
than experience obtained from instru" 
tions alone. However, the kind of pr ob” 
lems used to provide subtask S 
Perience has not been systematically 
Studied. In particular it is not know? 
whether problem heterogeneity, that 15, 
variation in the contexts and the ye 
SWers of a set of problems, is a facto 


in learning, 


The main purpose of this experiment 
was to study problem heterogeneity 
as a factor in learning a complex tas ч 
from а self-instructional program. 10 
do this, а task was analyzed into sub 
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e On some of the subtasks, one 
aR аг subjects was given heteroge- 
cim problems to solve. These prob- 
im i HMM оз in that they 
ie Ege in their contexts and 
Pee с aspacts of their solutions. 
Dobler group solved homogeneous 
ie s, ones that were very similar 
rine: contexts and in their solutions. 
om ka group worked problems that 
a » the learning done on previous 
s ч. This group was to provide a 
dibus n with whieh to compare the 
edie groups. After learning the 
Mak cs ks, all subjects were tested on 
cee of the task as a whole. Per- 
i н of these problems was used 
di Же the effects produced by the 
pes; kinds of subtask problems. 

7: E aspect of the present experi- 
beue to investigate the relation 
subject problem heterogeneity and 
Was wh те The question asked 
ате bu her heterogeneous problems 
puo er at one ability level and 
dite eee problems at another 
ней evel. To answer this question, 
mam a n several aptitude and achieve- 
TA —)Á were used to test 
fera nteraction between level of sub- 
ptitude and the effects produced 


y ; d 
Do" different kinds of subtask prob- 


METHOD 


The ; 
mani mii Task and the Experi- 
* al Variable 
h 
addition ss used in this study was graphical 
тарса]. positive and negative integers. 
arrow o addition involves drawing an 
the additi the number line to demonstrate 
imm of two numbers. This is illus- 
starts at the ion 1. Notice that the arrow 
the first i es location on the number line 0 
> rection prs to be added. The length and 
EBET во Pas arrow represent the secon: 
trow locat at the stopping point of the 
M The Seach the sum of the two numbers. 
nto MM addition task was analyzed 
аз writter s, and a program of instructions 
n by following the suggestions of 


(7) + (3) = ---------- 


c эч 
0707975 473300093 3 4 $ 8 7 8 9 10 


SER 
—— 


| Fic. 1. A solved graphical integer-addi- 
tion problem. 


Gagné (1962, p. 357). The program had four 
parts. The first part taught how to locate 
positive integers on the number line and 
how to draw arrows to represent positive 
integers. Part 2 of the program provided 
similar training for negative integers. 

The third part of the program taught how 
to draw an arrow of specified length from the 
number-line location of a specified integer. 
This training involved two subtasks, one for 


arrows drawn in а positive direction, the 


other for arrows drawn in & negative direc- 
tion. These two su i 

the elements of the graphi 
but they did not jnvolve addition problems 
like the one presented in Figure 1. Instead, 
these subtasks taught how to draw arrows in 
response to explicit instructions. In effect 
the instructions told the student to place his 
pencil at а certain point on the number line 
and draw the arrow representing а given 
integer. Then the student completed the task 
by identifying the integer found at the end 


of the arrow. 

Three different ki 
devised for the subtas 
part of ihe program. 
lems constituted the ex 
and may be described as follows. 

Heterogeneous Problems. One set of 20 
problems Was made heterogeneous in two 
respects. First, the contexts of the problems 
were varied by providing а different 20-unit 
segment О: er line with each prob- 
lem. Second, the heterogeneous problems 
were constructed 50 that their answers varied 
in certain ways f blem to the 
next. Specifically, the integer at the start of 


an arrow drawn t 


problem cou | 
ize than the integer 


arrow. Al 
arrow could ha 
the integer at the e 
signs could be different. 
heterogeneous problems 
the number-line segments an 
Part A of Figure 2. 

Homogeneous Problems. A second set of 
as homogeneous. The contexts 


20 problems Ww! 
of these problems were held constant by 


providing the same segment of the number 


nds of problems were 
ks included in the third 
These different prob- 
perimental variable 
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Fic. 2. Number-line segments and correct 
arrow responses for three heterogeneous sub- 
task problems (Part A) and one homo- 
geneous subtask problem (Part B). 


line with each problem. Also the answers to 
the homogeneous problems were very simi- 
lar. The integer at the start of the arrow 
drawn for a homogeneous problem was 
always smaller in absolute size than the 
integer at the end of the arrow, and the 
starting and ending integers always had the 
same algebraic sign. The number-line seg- 
ment and the arrow of a homogeneous prob- 
lem are illustrated in Part B of Figure 2. 
Review Problems. A third set of 20 prob- 
lems was not directly related to the skills 
taught in the third part of the program. 
These problems gave practice in drawing 
arrows to represent integers. That is, these 
Were review problems giving additional prac- 


tice of the skills learned in the first two parts 
of the program. 


The fourth part of the learning program 
taught the graphical addition task and pre- 
sented 26 graphical addition problems. These 
problems were designed to cover a wide 
range of difficulty. They consisted of one- 
and two-digit integers presented for addi- 
tion in the format illustrated in Figure 1, 
Each problem required two answers: the 
graphical one of an arrow drawn on the num- 
ber line, and the integer answer, 


Subjects 


The subjects were 294 sixth 
who were naive about graphic; 
tion. Of the total sample, 
came from a public school di 
ton, New Jersey. The remain: 
(76 boys) came from anoth 
district located near Princi 
of the sixth-grade students 
districts were included in 
students who were exclu 


-grade children 
al integer addi- 
167 (93 boys) 
strict in Prince- 
ing 127 students 
er publie school 
eton. All but 57 
of the two school 
the sample. The 
ded were absent 
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from school for 1 or more days of the experi- 
ment. 


Procedure 


The learning program was gutes А 
the form of a 150-page booklet. To varl; ins 
program, each student read a page 4 then 
booklet, made a written response, an back 
checked his answer by looking on the se 
of the page. Wrong answers were eiie 
before the student continued. This was E 
procedure for the first 130 pages of ше Я 26 
gram. The last 20 pages were for qos 
graphical addition problems. Students 2 Rn 
not check their answers to these prob nn 

As was previously indicated, the progr 
had four parts. It was administered por 
su sive days, one part being given cn 
day. On the first day, instructions for у 
ing the program were read. These ins d 
tions were similar to those used by peni 
et al. (1962). Then the students worked i 
l of the program. When cach student eae 
finished, he recorded a number pM 
how long he had taken to complete the ris 
part. (This was also done for the other i 
of the program as they were sump 
On the second day of the experiment, is 
2 of the program was administered. Par 
and 2 were the same for all students. кой 

For the third part of the program, ЖОГ s 
on Day 3, the total sample of children. à 
divided into four sections on the Б ие 
their sex and school district. Each of Mm 
four sections was further divided at ran ibe 
so that the students in each of three aie 
sections had the same range of a ce 
aptitude as measured by the Arithmetic КШ. 
cepts Test of the Iowa Tests of Basic Sk n 
(The children were first divided by neum 
school district because of pronounced di the 
ences in the distributions of scores ОЛ p 
Arithmetic Concepts Test. This test was vite 
to equate the sample subsections for ati ri 
metic aptitude because in a pilot Pe 
ment a correlation of 74 was obtained 
tween the Arithmetic Concepts Test an 
test of integer addition.) of 

It will now be useful to talk in tet ole 
three experimental groups constructed aS by 
lows: Each of the four sections forme Mer 
dividing the students on the basis of sex hree 
school district contributed one of its B 
subsections to each of the three experimen 


groups. Thus an experimental group both 
made up of students of each sex from 
school districts, the 


© experimental groups were given put 
Same program of instructions on Day p i. 
each got different subtask problems to W 
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О 
a pee got the 20 heterogeneous prob- 
an her the 20 homogeneous problems, 
Ше third the 20 review problems. | 
кз BA e part of the program was 
аи ов the fourth day. It was the same 
eub oe he students and contained the 26 
Te integer-addition problems. Re- 
dm 2.5 these problems were scored for 
a r of correct arrow answers and the 
erui vim correct integer answers. Also, the 
зл. EM wore qualitatively analyzed and 


RESULTS 


The E 
йе of Problem Heterogeneity 
raphical Integer Addition 


The main results of this study are 
реш in Table 1 which Es the 
monsites ^" standard deviations for two 
experir es of the performance of the 
Ыл Adae on the graphical 
that oW e problems. It is clear 
UM | best performance was 
lems gn by the heterogeneous-prob- 
teger group: Both the arrow and in- 
then + of this group were larger 
homo he corresponding means of the 
ee geneous- and review-problems 
Groups. Also evident from Table 1 is 
lems act that the homogeneous-prob- 
bor to d was not consistently Su- 

һе fo о the review-problems group: 
While Él had a larger arrow mean 
mem. he latter had a larger integer 
ian results were analyzed by one- 
amon; lg of variance. Differences 
ignite he experimental groups were 
era for both the arrow-response 
Тезрбиве (p < .01) and the integer- 
tional e measure (p < .05). Addi- 

euls analyses using the Newman- 

Mine сейше for group comparisons 

at the 1962, p. 80) demonstrated 

e A inet and integer means 0 
Were gj erogencous-problems group 

an rap mt greater (p's € - 5 
Other € corresponding means of the 
and im groups. However, the arrow 
Ous-prob[or means of the homogene- 

ems group and the review- 


TABLE 1 
M AND STANDARD DEVIATIONS OF 
RROW AND IxTEGER SCORES ron EACH 
EXPERIMENTAL GROUP 


Experi Arrow 
xperimental group | № Oe SD pattern SD 
Heterogeneous 
problems 97| 11.98] 8.78] 14.817.42 
Homogeneous jij 
problems 93| 8.85| 7.80| 11.907. 04 
Review 
problems 104| 7.11| 8.34| 12.30/7.86 


problems group were not reliably dif- 
ferent. 


The Interaction between Problem Het- 
erogeneity and Subject Aptitude 

as found of a reliable 
interaction between the effects of prob- 
lem heterogeneity on graphical integer 
addition and subject aptitude. The 
test for an interaction was made in two 
ways. First а two-way analysis of 
variance design was used in which the 
different experimental groups provided 
the categories for one main effect, and 
several levels of arithmetic aptitude as 
measured by the Arithmetic Concepts 
Test provided the categories for the 
second main effect (see the Treatments 
x Levels design of Lindquist, 1953, pp. 
127-132). Eight such analyses were 
carried out, one for the arrow Scores 
for the integer scores from 


No evidence W 


and one 

each of four sections of the total 

sample. The four sections Were ob- 
the subjects on the 


tained by dividing 
basis of sex and sc 
one of the eight analyses O 
had à significant interaction 
a 

2 Analysis of 


hool district.? Only 
f variance 


term (p 


variance was performed sepa- 
rately on the data from each of the four 
sample sections because the sections con- 
tained unequal numbers of subjects and 
covered different levels of arithmetic ability. 
These conditions made it impossible to per- 
form а higher-way analysis of variance in- 
eluding Sex and school district as main 


effects. 
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< .05). It was interpreted as being 
caused by chance factors. _ : 
'The second test for an interaction 
employed analysis of covariance (Gul- 
liksen & Wilks, 1950). Two analyses 
were made, one with the arrow scores, 
the other with the integer scores as the 
dependent variable. For both these 
analyses, the independent variables 
were scores on several aptitude and 
achievement tests that were admin- 
istered to the students as part of a 
more detailed study (Traub, 1964). 
The results confirmed the conclusion 
derived from the two-way analyses of 
variance: The heterogeneous subtask 
problems produced significantly better 
performance of the graphical integer- 


addition task independently of subject 
aptitude. 


Other Results 


Classes of Arrow-Response Errors. 
Three broad classes of errors were 
identified in the arrow responses to the 
graphieal integer-addition problems. 
These were Stereotyped, random, and 
omitted-response errors. Stereotyped 
errors had certain well-defined prop- 
erties in relation to the problems they 
Were to answer. For example, one ster- 
eotyped error had the arrow Starting 
correctly at the location on the number 


TABLE 2 
PROPORTION or Arrows CLASSIFIED AS 
SrEREOTYPED, RANDOM, AND OMITTED- 
Response Errors rog EACH 
EXPERIMENTAL GROUP 


Error category 
Experimental group [НН ШИШ Ре Total 
tereo- itted- 
joel Random peus 
Heterogeneous 
problems .210 | .221 | .108 | .539 
Homogeneous 
problems -253 | .164 | .242 | .659 
Review 
problems .388 | .185 | .156 


TABLE 3 
Means AND STANDARD DEVIATIONS FOR 
Each EXPERIMENTAL GROUP ON Twe 
MEASURES OF PERFORMANCE ON PART 
OF THE LEARNING PROGRAM 


Errors on Working time 
subtask problems! (in min.) 
Experimental group 
M SD M SD 
Heterogeneous , 
problems 3.43 | 3.94 | 34.53] 8.78 
Homogeneous 
problems 2.13 | 3.52 | 29.35) 8.94 
Review 
problems ло | .32 | 24.22) 6.39 


line of the first of two integers being 
added. Instead of representing the sec- 
ond integer, these arrows were drawn 
to the location of the second integer. 
There were six other clearly identifi- 
able kinds of stereotyped errors. The 
second class of errors was random; 
they could not be related in any way 
to the problem being answered. The 
third error type was the omitted re- 
sponse in which no arrow was drawn. 
Table 2 gives the proportions of arrow 
answers classified in the three erro! 
categories for each experimental group. 

The most interesting result in Table 
2 involves the heterogeneous-problems 
group. This group made the lowest 
proportion of stereotyped and omitted- 
response errors, and the highest pro- 
portion of random errors. 

The Effect of Heterogeneity on thé 
Difficulty of Subtask Problems. Table 
3 presents means and standard devia- 
tions for each experimental group 00 
two measures of performance on Part 
3 of the learning program, These meas- 
ures were the number of errors made 
in working the 20 subtask problems 
and the time required for Part 3 of the 
program. It is clear from Table 3 that 
the heterogeneous-problems group 
committed more errors and required 
longer working time than the other two 
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experimental groups. The homoge- 
Hrs aid mn group had the second 
is ce шп Score on both measures, 
fis t the review-problems group had 
sea All differences between the 
2 wd means reported in Table 3 were 
us ш (p's < .01) when tested by 
iani ewman-Keuls. procedure. These 
ak, p group differences contrast 
Joana he nonsignificant differences 
for P among the time and error means 
it ioe 1 and 2 of the program. Thus 
E зү that the differences obtained 
E 3 of the program were pro- 
gw Ed differences in the difficulty 
The h үе kinds of subtask problems. 
dite ie м problems were most 
oat , the homogeneous problems 
метл most, difficult, and the review 
ms easiest. 


Discussion 


ы this study, students who worked 
iu Did subtask problems per- 
iex. the task significantly better 
aS шеша who worked either the 
vant geneous problems ОГ the irrele- 
this ft ai spite One reason for 
of + ape is suggested by the results 
eee analysis of responses to the 
e аы; integer-addition problems. 
oo a groups differed 1n 
could ойып of arrow responses that 
rors, S. classified as stereotyped er- 
pro та heulen; the heterogeneous- 
Proportio group made the smallest 
ls gr lon of stereotyped errors. Also, 
Portion of omitted the smallest pro- 
geneous., responses. Thus, the hetero- 
Stereot; -problems group made fewer 
More урей errors while answermg 
t „Problems. 
erogene clear that one reason the het- 
on (ineous-problems group did better 
made f nal task was because they 
Omitted ko, stereotyped errors an 
or this j ewer responses. The reason 
fous is probably that the heteroge- 
problems contained more intor- 


mation about the task than the other 
kinds of subtask problems. Support for 
this suggestion 1s provided by the find- 
ing that the heterogeneous problems 
were the most difficult. It is likely that 
difficulty has а high correlation with 
the amount of information contained 
in a set of problems. 

Additional and logically more direct 
support for the information interpreta- 
tion comes from an inspection of the 
heterogeneous subtask problems and 
the graphical integer-addition prob- 
lems. The fact is that both these sets 
of problems were heterogeneous in the 
sense of presenting problems in the 
context of several different number- 
line segments. Moreover, the different 
kinds of arrow answers required for 
the heterogeneous subtask problems 
were also required for the graphical 
integer-addition problems. 

From the foregoing it is clear that, 
while all groups had the same instruc- 
tions for solving the graphical integer- 
addition task, the heterogeneous-prob- 
lems group had more information 

in aspects of the task. The 


about certain as 
homogeneous- and review-problems 
roups di not have the information, 


groups the graph- 
addition pro lems would 
more complex. 


procedure that re- 
tereotyped error, 


esponse at all. 
findings of the 


indicate that 


when 


instructional program. The importance 
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of heterogeneous problems is addition- 
ally emphasized by the negative find- 
ing of the study. This finding was that 
the different kinds of subtask problems 
affected learning independently of 
level of subject aptitude. Heteroge- 
neous problems were best for students 
of high and low aptitude. 
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ing a learning- ry fi it w dicted that individ 
Usi n learning-theo ormulation it was р! di i 
1 ri individuals 
vill perform better on learning tasks if they are in HiCo groups (with 


highly liked others) rather than in Lo 


others). Each of 206 4th- and 5th- 


a а x e ig 
а тарот information, ір а same-sex HiC 
parisons, made on 2 verbal learning tasks, re 


Co groups (with less-liked 


е i f grade children, classed as high or low 
in IQ (California Test of Mental Maturity), was placed, on the basis 


iCo or LoCo group. Com- 
tention, and relearning, 


indieated that high-IQ children (especially girls) in HiCo groups did 


consistently better than those in LoCo 


cohesiveness made no reliable difference а 
ghtly better. Other measures are congruent 


groups tended to do sli 


with the learning-task findings. Results are discussed in te 


low-IQ children 


groups. For 
those in LoCo 


Ithough 


rms of drive 


and task complexity (competing responses). 
Sun ap Spe 


In an 
Broup 
us 

es] 
ns d has been formulated 
ory, Th reinforcement learning the- 
that th e present approach assumes 
attractiy primary source of à group’s 
attractivences to its members is the 
Oncept eness of fellow members. The 
that be Of attitude is substituted for 
ha a attraction because the former 
Teile, 28686 and particular meaning 
ae learning theory 
and dri icipatory response with cue 
and оору. properties—Doob, 1947), 
f Seas ce is then defined as 
Tom th P property which is inferred 
Mutua e number and strength of 
em Positive attitudes among the 


bers (Lott, 1961). 


i tempt to coordinate small- 
ehavior to more general be- 


principles, the concept of co- 
in 
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ащ of fre the Cooperative Research 
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mig - principals in Lexington, Ken- 
in]! p OF Ashland School, J. 
a gc School, 8. Gabby of 
ieir s T. Dunn of Mary Todd 

Ery u cachers, and their pupils. We 
St the 2d Bu also, to May Briscoe and 
s ah 88 for help in various phases 
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The authors have previously tested 
hypotheses regarding antecedents of 
cohesiveness (James & Lott, 1964; 
Lott & Lott, 1960). ‘Theoretically 
derived predictions regarding the 
consequences of cohesiveness also fol- 
low from placement of the concept 
within a Hullian framework. In an 
earlier investigation, degree of group 
cohesiveness Was shown to be posi- 
tively related to both intragroup level 
of communication and the tendency 
of members to conform to à dominant 
group opinion (Lott & Lott, 1961). 
The present study was designed to 
test another predicted consequence of 
variations in cohesiveness. Since at- 
titudes are presumed to have drive 
value, the greater the degree of inter- 
member liking (group cohesiveness) , 


the higher should be the general drive 
dividual members. Since 


level in in er 
learning is ordinarily & positive func- 
tion of drive, the authors predict 
that better individual learning will 
high- than in low-cohe- 


take place in { А 
sive groups (other things being equal), 
or that individuals who are in the pres- 
ence of highly liked others will per- 
form better on learning tasks than 
individuals who are in the presence 
of less-liked others. 


Little previous research has been 


62 Атвевт J. Lorr лхо Bernice E. Lorr 


concerned with learning as a function 
of interpersonal attraction per se. 
Perkins (1950) and Rehage (1951) 
investigated the consequences of dif- 
ferent classroom “climates” for learn- 
ing. or problem solving, but in these 
studies manipulation of leadership 
(teaching) techniques constituted the 
independent variable. More relevant 
to the present study is the finding by 
Shaw and Shaw (1962) that early 
spelling scores were positively cor- 
related with the degree of liking 
among members of second-grade study 
groups.” 

In the present study we compared 
high- and low-cohesive groups of 
fourth- and fifth-grade children of 
similar tested general intelligence on 
a verbal learning task, retention, re- 
learning, and a second verbal learning 
task. The authors’ expectation that 
children in high-cohesive groups will 
learn better and retain more than 
children in low-cohesive groups pre- 
supposes that the children being com- 
pared are similar in general learning 
ability, and it was assumed that the 
latter is validly reflected by IQ scores. 
Specifically, the authors predicted 
that among children who are similar 
in tested intelligence, either high or 
low, those who are in small groups 
with other children whom they like a 
great deal will learn and retain new 
material better than those who are 
with children they like less. 

Since it was assumed that objects 


2A number of recent investigations have 
tested hypotheses which relate verbal con- 
ditioning to the relationship between S and 
E, or simply to the attractiveness of E 
(e.g, Binder, McConnell, & Sjoholm, 1957; 
Reece & Whitman, 1962; Sapolsky, 1960). 
Although these studies are theoretically 
relevant to the present hypothesis, discus- 
sion of them will be omitted since the 
focus here is on learning within a group 
context where the group is composed of 
individuals who are mutually attracted to 
one another to & high or low degree. 


or persons which evoke positive atti- 
tudes can serve as secondary rem- 
forcers (Hull, 1952; Lott & Lott, 
1961), that is, can strengthen be- 
havior when they are presented con- 
tingent upon that behavior, it was 
necessary to eliminate as far as possi- 
ble the chances of this occurring. Such 
direct intermember reinforcement 
would confound the effects of in- 
creased drive based on the mere pres- 
ence of liked group members. If the 
group members were permitted to 
freely interact, then the children 10 
high-cohesive groups could reinforce 
a variety of one another's behaviors; 
related or unrelated to the learning 
tasks. Consequently, the experimenta 
situations were carefully planned to 
control and channel member inter- 
aetion. Both verbal and nonverbal 
communication among members 0 
all groups was strictly structured an 
confined to task essentials. 


METHOD 


Subjects 


Four elementary schools, three predom® 
nantly white and one entirely Negro, 7 
student body, cooperated in this investi 
tion. Within each school, subjects (S8) ЧЫП 
obtained from one fourth- and one fift A 
grade class. For Learning Task I (LTD d 
total of 206 children (97 boys, 109 girls) bo 
selected on the basis of sociometric and A 
score considerations to participate in thre! 
or four-person same-sex groups. ed 

One week after LTI, the same S8 (reduc p 
in number to 155; 71 boys, 84 girls) Tae 
tested, within the same small groups, 00 nd 
tention (RET), Relearning (RED) 9, 
Learning Task II (LTII). S losses are ® 
counted for by absences: In the case ad 
three-person groups, the absence of one chi 
eliminated the other two; if one chil 
absent from a four-person group, howe o 
the group simply became а three-pel* 
group for the remaining tasks. 


eti 


Procedure 


а wo 
Experimental Formation of Groups- T Я 


4 D 
days prior to presentation of LTI, all me s 
bers of each of the eight participating € e 


GnouP COHESIVENESS AND LEARNING 


were giy P 
their pem. a sociometrie test (Socio I) by 
Friendship D teacher. This consisted of а 
all d in which each child rated 
the following E children in his class on 
——— I like him very, very 
Meu. hr гу, very much. 

e» him. He's alright, but I like 

me other boys better. 

— — Someti е hima li 

z imes I like him a little bit and 

metimes I don’t like him. 

at of the time I don’t like him. 
um him and don't think I 
, Each te ony 
me esi employed the same simple 
oard tol "We with the aid of a large card- 
demonstrated of a Friendship Book page, 
the опе iis how to place an X in front of 
an S felt eee which best described how 
*Ppeared "po the boy (or girl) whose name 
inga Were. th e top of the page. These rat- 
Ag from 5 Ct converted to numbers rang- 
0 ]. I like him very, very much.") 


— 
UE 


High. n 
(Loco priis (HiCo) and low-cohesive 
m dren Ha QN three or four 
cop tal rating formed on the basis of similar 
heat Slveness SE À single score, indicating the 
the Veness ind. vel of a group (its group-co- 
a e formul ex—GCI), was calculated by 
Tatts, and ү сс Qr c Th mem x= 
tap bere who N = the number of group 
mers into а rated each other. This formula 
Mber by ee the rating of each group 
Bes] os other member and is, thus, 
1901) lon of measure of our conceptual 
Tan )- Thirty-five hesiveness (Lott & Lott, 
and ДЕ in G ve HiCo groups were formed, 
27 ТСС from 50-42 (М = 4.59), 
groups were formed, ranging 
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in GCI from 33-22 (M = 2.91). Of the total 
of 62 groups, 37 had 3 members and 25 had 4. 
In addition to the sociometric ratings 
made and received, Ss’ IQ scores were also 
taken into account in placing them in groups. 
The California Short-Form Test of Mental 
Maturity, Elementary Form, 1957 (CTMM) 
had been administered several weeks prior 
to Socio I. A child with a total IQ score of 
116 or above was eligible for placement only 
in a high-IQ (Hil) group; 2 child with a 
score of 100 or below was eligible for place- 
ment only in & low-IQ (Lol) group. Those 
with scores between 115 and 101 could be 
placed in either Hil or Lol groups, but the 
authors tried to avoid putting an S into a 
Lol group if his IQ score was above 108 and 
putting an S into a Hil group if his score 
was below 108. It must be kept in mind that 
Ss had to be placed in groups with other 
IQ level as well as 


children of & compatible е 
with other children whom they rated highly 


and were highly rated by (HiCo groups), or 
whom they rated poorly and were poorly 
rated by (LoCo groups). | 
Table 1 describes the independent-variable 
conditions, Hil-HiCo, Hil-LoCo, LoI-HiCo, 
and Lol-LoCo groups in terms 0 
dex, total num- 


cohesiveness in! 
an individual IQ score. 


ps) were present for 
subsequent tasks 
(RET, REL, and LTID, the independent- 
variable conditions are also described for the 
latter. It is clear from Table 1 that S losses 
produced only minor changes in average CO- 


hesiveness and IQ-score characteristics. А 
, 


Learning Task I. The experimenter 


when introduce 


cher, 


TABLE 1 
SuMMARY OF [EXPERIMENTAL CONDITIONS 
манак No. groups Mean GCI Total No. 5s Mean IQ 
rni 
Hipp Task 
iL pio : 4.63 64 120.98 
REN 12 2.97 39 120.46 
Lorg co 12 ee ^ vien 
Tor Т i 49 96.80 
al 15 2.86 E. 
Sten gy 62 
" 
Hi ee _Relearnin d 
ni-Hi Thing Task IP an 
et Lote 4.62 47 121.21 
to Hie, io 2.93 28 120.96 
"Тосо E 4.53 4T 99.32 
Ча ° 10 2.91 E: 98.47 
49 
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ained that he wanted to find out how 
Е children сап learn words in a foreign 
language. The previously formed small 
groups were then called to the experimental 
room in a random, predetermined order and 
given identical task instructions. _ | 
LTI consisted of learning Spanish equiva- 
lents of English words. Ss first. worked as a 
group in arranging and assembling the mate- 
rials: 12 64 X 5% inch plastic-covered cards, 
each containing a four-word Spanish sen- 
tence (e.g., ^Este es un vaso.") and an appro- 
priate india-ink illustration, and 12 plastic- 
covered 5 X 2 inch cards, each containing a 
four-word English sentence, equivalent to 
one of the Spanish sentences (e.g., “This is a 
glass"). Each English card could be fitted 
onto a Spanish card by sliding it under a 
paper clip on top of the latter. Ss were told 
that they would each receive three or four 
cards (depending upon the group size) with a 
picture and Spanish sentence on each, and 
were instructed as follows: 
In the center of your table I will place 
a stack of smaller cards. These will not 
have pictures on them. Instead, each will 
have just an English sentence on | 
Your job will be to match up each 


English card... with one of the Spanish 


picture cards.... You can do the matching 
job best if each 


one of you, in order, will 
turn up an English card on the top of the 

Stack, read the sentence out loud, and then 

see who has the Spanish picture card that 

goes with it.... [Whoever] has the right 

Spanish picture card gets the matching 

English card [and]... can then put the 

two cards together... , 

When you have matched each Spanish 
picture card with the correct English one, 
let me know.... 

The amount of time it took each group to 
assemble the materials was recorded by an 
assistant. 

Following the group matching procedure, 
Ss were told that they would be tested on 
their knowledge of the English meanings of 
Spanish words after a period of studying the 
materials individually. They were instructed 
to: 

Study your own [cards] ... first. When 
Miss Briscoe rings this bell, pass your 
pairs to the...right. Study [the new cards 
you've received]...until the bell rings 
again. Then pass your cards again to the 
.e right. Well keep doing this until 
you've all had a chance to study all the 
...eards twice... . Study quietly.... 

The interval between bells was either 1% or 
2 minutes, depending upon whether there 
were four or three Ss in the group (and thus, 
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upon whether each S had three or four 
cards). 

After 12 minutes of study the cards T 
collected, and a mimeographed answer aer 
with 15 spaces on it was distributed to each 
S. Each Spanish test word was gronente 
once, one at a time on a printed card he 
in front of Ss for 15 seconds. Ss were told to 
write the English meaning or to draw à € 
in the appropriate space before going on 
the next test word. r 

Retention and Relearning. One week afte 
LTI, Е reported to the class that they ha 
done well on the previous learning task, aie 
he once more solicited cooperation for ай = 
tional language learning. The same group 
were again called to the experimental ци 
in a predetermined random order wher 
they were given instructions for a retentio® 
test on 10 of the 15 Spanish words on which 
they had been tested in LTI. On the reten” 
tion test each Spanish test word was dii 
posed on a card to the group for 10 secon i) 
E then turned the card over to expose * 
English equivalent for approximately e 
seconds, saying the English word aloud аз 
flipped over the card. . hree 

After the RET trial, Ss were given thro? 
relearning trials. The same words were Ps 
sented by the same technique, but in & E Ss 
randomly determined order on each trial- 
were instructed to stop if they got all it 
words correct on any trial and to sit poe 
the group while the others went on. A D ob 
answer booklet was distributed prior to €? 
trial. the 

Learning Task II. Immediately after 5" 
last REL trial, Ss were introduced 10 Ah 
other task, the learning of 10 new Sp 
nouns and their English equivalents (¢ 
"silla"-"chair"). Each S was given his Sile 
list of the 10 Spanish-English pairs, wo 
the group as a whole received a stack 0 "t 
cards, each with one of the Spanish wor FA 
it. E suggested the following study PI? 
dure: 

Starting with you...why don’t 
turn over the first card when I say 
gin.” Place it on the table so that eve 
опе can see it and all of you try tO 
on your list what the Spanish word ™ 
in English. Call out the correct TRE 
answer as soon as you know it. Then 
turns turning up a Spanish card from 
center pile. After a few minutes 
give you the same kind of test that 
gave you on the old words. k of 
After Ss had gone through the stat the 

cards four times, Е gave four trials 0 "ys 
new words, using the same procedur? s, 
that followed previously for RET and R 


you 
«pe^ 
ту 


ean? 
jish 
ake 
the 


I just 


] 
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Sociometri 
weeks es n Test Il. Approximately 2 
Bogner. ET, REL, and LTII, a second 
each class Mana (Socio II) was given in 
ss by the regular teacher, following 


the s 
„е Same ^ 
cio p. ^ Procedure as that described for So- 


RESULTS 


Lear ni k res 
ng Tas с Scores 


The me: 

our differen scores earned under the 
Were cal nt experimental conditions 
task culated for each learning 
presented in Figure 1. 
pw show mean scores 
ively 3 and low-IQ Ss, re- 
2ҮС'у, under HiCo and LoCo 
; Separately for each sex." 
m pae maximum possi- 
test um "id 5 correct words; the 
8 me Was 10 (I on all subsequent 
By b. seared aci and LTII, an 
k en the sa on any trial was 
адь у 6 80016 100 uy remain- 


eI Р 
pad & vi vatiable high versus low, 
of ing uen cant difference on every 

»witl except for the last trial 
than L Hil Ss scoring reliably 
of me D Ss in every case. The 
“ао в n analyses using 
00 88 follo obabilities (Siegel, 1956) 
Ret Rit, a ЈАЗ, a^ = ВА < 
GENERE E є д1; НЕ 
tial y Ж „= 1095, p < 001; RED, 
pial g X = 906, р < 01; REL, 
ki = 492, р < 05; ШП, 
g X, = 1096, p < 001; LTIL 
>“ 5.63, р < 02; LTII, 
T e 5.50, p < .02; ШП, 
justif 72, p < 10. Since these 
d Poa. moore treatment of 
Се for fr Q Ss, an analysis 0 
WO requency data (Wilson, 
he be Minar, 
ed ho of ed analyses indicated that the 
tes? Те ‘Ons in a group, three or four, 
à effect on the learning-task 


Ven ` во duris 
omp: cta from bot Á 
ed, oth types of groups 


m bin, 


65 


1956) was performed separately for 
Hil and Lol Ss, for each set of learn- 
mg scores, to test the effects of co- 
hesiveness and sex. This analysis 
compares conditions in terms of the 
number of individuals within each 
who scored at or above the median 
and the number who scored below the 
median. The results of these analyses 
are given in Table 2. 

Examination of Figure 1 will show 
that in the Hil condition HiCo Ss 
did better than LoCo Ss, as was pre- 
dicted, in every case except for LTII, 
Trial 1. The relationship between 
learning-task scores made by Hil Ss 
and cohesiveness is statistically re- 
liable on RET, and on the first two 
REL trials, and is close to the stand- 
ard level of significance оп the third 
REL trial, and on the third LTII 
trial (sce Table 2). 

The analyses of variance sum- 
marized in Table 2 indicate that 
among Hil Ss, sex is not reliably 


associated with the scores (except on 
h the girls did 


LTII, Trial 2, on whic 
significantly better), and there 18 
significant co- 


little indication of ) 
hesiveness-sex interactions. What is 
not revealed in Table 2, however, but 
can be clearly seen by inspecting 
Figure 2, is s in 
g between 


i ifferences 00 each learn- 
HiCo-LoCo difai joa tm 
2 (1.е., P < 105) was ob- 

a reliable relation- 

n ohes and learn- 
bri à ‚ nine measures: LTI 
(2 = 5.67); REL, 
2 — 392); REL, Trial 2 
(е = 5.24); LTII, Trial 3 (2 = 
5.67); and ШП, Trial 4 [3 = 5.67). 
In contrast, among the high-IQ boys, 
conditions O high and low cohesive- 
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Ета. 1. The experimental conditions (total sample) compared on the learning tasks. 


ness made no reliable difference on Turnin e 
А g now to the low-IQ Ss, V 
any learning measure.* find a strikingly different ip oí 


‘We may look at the same data in the sign test (Si e 
: egel t tb 
another way. On every one of the nine probability hu Ge Ele p ur 
learning measures, HiI-HiCo girls did better, 


1 d chance is 004 (two tailed i is not 
on the average, than HiI-LoCo girls. Using Strictly applicable шо з ай 
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SNFLMNDONLHABDONAHA TONSA ЖЕКЕ ЖЕЛЕ ЛЛ: 
Т 


ES 


Mean Scores 


w 


D 


Learning 
Task I 
Retention 
Relearning 
Trial 1 
Relearning 
Trial 2 


Ti " М 
92. HT ва compared, by cohesiveness condition 


Teg 

that i It can be seen in Figure 1 
Rast every one of the learning-task 
Ss i, 165 the average score of LoCo 
a meester than the average score 


tak 9 Ss, in direct opposition to 
е] 


Tort o 

va the ye same Ss, but it is suggestive. 
T boys, on the other hand, av- 
ve . Scores exceeded LoCo scores On 
ut of nine measures; the two- 


Xo Pri F 
Menon ie "y that this is a chance phe- 
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HiT Hi Co Girls A——4^ 
Boys @-—--® 
HiI LoCo Girls 4—45 


Boys 0--—0 


Learning Task II 
4 


Trial 
Trial 


Learning Task IL 

Trial 2 

Learning Task IL 
3 


Relearning 
Trial 3 
Leaming Task IL 
Trial 1 


and sex, on the learning tasks. 

the authors' předietion P Abs 
i on Hil юз. ela- 

results obtaine bep 


: : i 
ship, however, е 
Sek ui the learning measures is not 


is indicated in 

i > .05), as 18 indicate 
к; (P Dough significance is ap- 
Led on RET, REL Trial 2, and 

ials 1 and 2. 

gon analyses Were performed 
the scores made by boys and girls. 
Among the Lol girls, the average 


e 
[7] 


m о aa o£ 
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Mean Scores 


m 


Retention 
eleorning 


Relearning 
R 
Trial 3 


Trial 2 


Ar 
Fia. 3. LoI Ss compared, b 
LoCo score exceeds the 
score in every case exce 
trial of LTII, but non 
hesiveness-score relati 
tistically reliable (median tests). On 
REL Trial 2, a x? of 2.92 was ob- 
tained, which approaches significance 
(p < 10 > .05).5 For the Lol boys, 
the average LoCo score is greater 
than the average HiCo score in every 
case. Median test analyses of the 


average HiCo 
pt for the last 
e of these co- 
onships is sta- 


"Using the sign test, it is found that a 
pattern of eight results out of nine in one 
direction (LoCo > HiCo) has a two-tailed 
chance probability of .04, 


° The probability is .004 that nine out of 
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LoIHiCo Girls 4 E 
Boys g 

Lol LoCo Girls bo 
Boys O77 


H H H H 

е е ё е 
ew on 

g= Ж g ew 

BL. t. E. E 

55 52 55 55 

ES. = SE Se 


y cohesiveness condition and sex, on the learning tasks. 


data on each learning test yielded ? 
х? of 336 (p < 10 > 05) on REY 
and а xè of 687 (р < 01) on LTH 
Trial 2. All other chi.squares ® 
nonsignificant. 


Time Measure 


„Оһ LTI a record was kept of r 
time that it took for each group 
children to 


match the Spanish м 
tences to their English equivalen”; 
his was a group activity, prelimina” 
to individual study. 


he results are shown in Table 4 


nine results j E ates m hane? 
ne results in one direction is a € 
phenomenon (sign test). 


Group COHESIVENESS AND LEARNING 
69 


Sussa ше 2 
— ABLE 2 
omn SS ANALYSES OF VARIANCE (Е 
ES AD ов FREQUENCY d 
( REQUENCY Data) ON LEARNING-TASK SCORES 
Hil Ss 
Lol Ss 
Task 
x? for source of variance? x? for source of variance^ 
а Cohesiv i i 
hee И ohesiveness Sex Interaction Gohesiveness | Sex Interaction 
RET 2.47 2.57 3.34* eem = 
EL, Tri; 4.01** — = 2.79* 5.13 а 
Re rd 5.28** im | = | = 
REL arial 2 ‘ore Z = |) Z = 
LTT Trial 3 507 EE _ ond = = 
LTY Tal 1 id = 
» Trial 2 n" 55 03 a 55 К 
Vn ual 2 2.1 4.25 | 1.08 3.09" | 2.55 3.83 
Ler 48 3 3.54* - {| 2 
rial г 
ӘД, | яз | 106 2.85* | 1.30 2.46 2.25 
ию — For H | T 
1le—Por Hil Ss, N = 10: = 
'L, 80 on mede ies 103 on LTI, 75 on subsequent tasks; for LoI Ss, N = 103 on 
lity of obtaining a significant x? (deter- 


mi EI — 
lined by the mo, case. If there was 


t 
he Case 


Dis 
* 10 5 
»« 08 ^ .05. 


16 magnitude of the total x 


T 
iffe. 
теп 
Ces between means were eval- 


1949), ps the £ test (McNemar, 
DR Work 0 groups were found to 
Co gro ed significantly faster than 
ample thee within the total Hil 
одаје офа is, boy and girl groups 
vi Sim у and within the 
meen 5 of girls. The difference 
je оуз iCo and LoCo groups of 
L not at in the same direction, 
0 grou atistically reliable. For the 
5 the! the results are quite dif- 
wi lve are no reliable differ- 
i hi lh HiCo and LoCo groups 
Ca YS total sample, or for girls 
moe cage considered separately. In 
th, e эе ЕБ, the LoCo groups 
the he E y faster, on the average, 
8 üngin liCo groups, in reverse of 
qd is m for the HiI groups. 
this ti considerable interest that 
; i measure, as one woul 
CE NN Hil groups were faster 
LoI groups, on the average, 
on 4 conditions, but under 
ditions, Lol groups were 


no possibi 
3), further analyses were not per 


wherever x? б 5 ; 
ever x? figures are not given 1n the table. 


formed. This was 


faster than Hil groups. In neither 
case is the difference between the IQ 
conditions statistically reliable (t = 
1.80, df = 33; t = 1.23, df= 25—for 


TABLE 3 
py GROUPS TO Мхлтсн SPANISH 


Time TAKEN 
то ENGLISH SENTENCES IN LEARNING 
Task I 
Condition ears Миы, s Й 
аја MENS ee 

Total sample 
Hil-HiCo 18 | 112.78 19.1 | 9 og« 
Hil-LoCo 12 | 17.33 51.9 | ^ 
Lol-HiCo 17 | 135.93 48.5 _ 
Lol-LoCo 15 | 127.00 26.3 

Girls 
Hil-HiCo 10 | 111.00 19.3 | i5 
HiI-LoCo 8 | 134.12 i.p € 
Lol-HiCo g | 130.88 66.8 | _ 
Lol-LoCo g | 117.88 16.6 

Bal i1-HiCo g |115.00| 19.8 | 441 
Hil-LoCo 4 | 173.75 | 82.3 | ` 
LolI-HiCo 9 | 139.67 | 28.8} _ 
Lol-LoCo т | 137.43 | 32.5 


*р< .05 (two-tailed test). 
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TABLE 4 "m 
" IN VTERA! 
ч IN АуЕВАСЕ RATINGS or OTHER Group MEMBERS FOLLOWING INTER 
ан IN THE LEARNING Tasks 


i i 1 
Condition No. Ss. | Mean ratings | Mean ratings Ma sa 
i Жжж 
е ніСо 30 4.76 4.41 = 196 .08 p. c 
Hil-LoCo 16 2.97 4.00 1.03 .23 P 
LoI-HiCo 21 4.53 4.58 05 11 ghee 
LoI-LoCo 15 2.95 3.74 79 .99 2 
M аы, 18 4.38 4.56 .18 14 ae 
Hil-LoCo 15 2.84 3.35 .51 .99 as 
LoI-HiCo 22 4.53 4.80 —.03 .08 TÉ 
LoI-LoCo 17 2.95 3.40 .45 22 : 


* р < .10 > .05 (two-tailed test). 
* p < .05 (two-tailed test). 
*** p < .001 (two-tailed test). 


* 


HiCo and LoCo, respectively), but 
under the HiCo condition the two- 
tailed probability approaches signifi- 
cance (p < .10 > .05). 


Changes in Sociometric Ratings 


By comparing Ss’ ratings of their 
fellow group members on Socio II 
(subsequent to group interaction) 
With ratings on Socio I it could be 
determined whether any changes were 


related to the experimental condi- 
tions. 


The data shown in Table 4 are 
taken from those Ss who were present 
in their groups for RET, REL, and 
LTII and who were also present when 
Socio II was given. It can be seen 
that the children in LoCo groups 
rated fellow group members signifi- 
cantly more favorably after their 
group experience than before; this js 
true of Hil girls and boys, as well as 
Lol girls and boys. Children who had 
been in HiCo groups, on the other 
hand, tended not to change their 
ratings of one another (which were 
initially high), except in the case of 
Hil girls who rated each other on 
Socio II significantly lower than they 


e 
had on Socio I. Although the gm 
decrease is small, it is highly relia 


Discussion 


The results on all the learning pU 
considered together, indicate sh“ 
high-IQ children who were in Di 
cohesive groups did generally be jp 
than high-IQ children who To j 
low-cohesive groups. For low-IQ € = 
dren, however, cohesiveness, OT ag 
gree of interpersonal attraction saei 
group members, made no differen ot 
although there was a tendency „in 
such children to do slightly bettet. ye 
low-cohesive than in high-cohes, 
groups. The findings for high-IQ © sit 
dren thus confirm our hypothe 
while those for low-IQ children 
not and suggest an opposite tren "by 
similar phenomenon was reporte 51) 
Calvin, Hoffman, and Harden (19 di 
Trends in three experiments n 
cated that, for Ss of high intellige?" ; 
group problem solving is better } "m 
Permissive than in a traditio of 
group “climate,” whereas for spot 
average or below-average intellige 
a permissive climate is a handicap: 


ta 
a 
Post-hoc consideration of our 4 


Group COHESIVENESS AND LEARNING n 


ha 
у еч a possible explanation. 
under hi ipa predicted that children 
ein b eene conditions would 
cohesive ter than children under low- 
mer Er ae because the for- 
i. eee of liked 
vated э be more highly moti- 
attitudes have greater “drive.” Since 
rive S are presumed to have a 
highly component, individuals with 
Broup positive attitudes toward other 
soned Ге should, it was rea- 
evel EE a higher general drive 
indiviq er things being equal, than 
slightly "Is with more neutral or 
their fel] negative attitudes toward 
is true "E group members. While it 
etter t} лаб the higher the drive, the 
Where n learning for simple tasks 
ls e ternative responses are few, 
ask is P known that when a complex 
теа 0 огу high drive will 
Sbonses en incorrect competing Te- 
impedi as well as correct ones, thus 
ber q * the learning process (Far- 
he inn So: Taylor, 1956). 
omeno: ors suggest that this phe- 
Present e has implications for the 
d sania conditions. High- 
Ow-IQ ren were reliably superior to 
tests i children on all the learning 
elatiy ndicating that our tasks were 
than ү? 100ге simple for the former 
t i e latter. Among the former, 
associata aed drive (assumed to be 
Oups) ed with being in high-cohesive 
Perform should have improved test 
Serve ml while it should have 
ithin depress it among the latter. 
“Ohesiy n the high-IQ sample, the 
fe Puto ee conditions made less dif- 
Wh this the boys than to the girls. 
пес is should be the case is not 
Stro € clear. Perhaps girls have 
s Was. affiliation needs and more 
th e di ee ties than boys. Or, the 
ü e Sui Dal learning tasks which 
ally gi ral employed may be essen- 
rls’ tasks at the developmen- 


tal period represented by ages 10 and 
11. High-IQ girls may be especially 
at ease in such learning situations, 
in which case the heightened drive 
provided by the high-cohesive con- 
dition should serve to improve learn- 
ing for the girls, but not for the boys. 
Observations of boy and girl groups 
made by E and his assistant during 
the learning sessions support the view 
that the former were less at ease and 
more distractable. 

Within the low-IQ sample, the 
average scores of children in high- 
cohesive groups tended to be some- 
what lower than those of children in 
low-cohesive groups regardless of sex, 
although the differences appear to be 
greater for boys. This is congruent 
with our assumption that the learning 
tasks were more difficult (complex) 
for low-IQ children and for boys. 

The results obtained on a time 
measure in Learning Task I are in 
line with, and support, the data ob- 
tained on the learning tests. HiCo 
groups were significantly faster than 
within the high-IQ sam- 
ple; again, this was especially true of 
the girls. Within the low-1Q sample, 


however, rked 
faster than al- 
though not reliably. Thus, for the 
high-IQ Ss, the condition of working 
with liked others facilitated perform- 
ance in the sense of increasing task 


speed, while for the 
same condition seeme 
with performance. . 
The authors' findings 0n changes in 
sociometric ratings of fellow group 
members following interaction in the 
learning tasks are also interpretable, 
post hoc, within а learning-theory 
framework. In all conditions except 
hildren tended either to in- 


one the € i 
crease their ratings of other group 
members or to rate them the same as 


they had previously ; this is generally 
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expected following neutral or positive 
interaction. Increased favorability of 
ratings was statistically reliable for 
LoCo Ss, both boys and girls, and 
both Hil and Lol. Since HiCo Ss 
started with ratings of group members 
which were close to the maximum, the 
only possibilities following interac- 
tion were no change or decreased 
favorability. The former result was 
obtained for HiCo boys, both Hil 
and Lol, and for LoI-HiCo girls. 
Among Hil-HiCo girls, however, so- 
ciometric ratings decreased signifi- 
cantly. In attempting to understand 
this latter finding, it is important 
first of all to keep in mind that feed- 
back from the learning tasks was 
similar for all Ss. E told all Ss, 1 
week after LTI, that they had done 
well, but no evaluation was given 
after the other learning tasks. Ss did 
know each time they got a word right 
or wrong on REL and LTII, but they 
could not compare themselves with 
other children to determine relative 
Success or failure, The HiI-HiCo 
girls were Superior to all other ex- 
perimental conditions on every learn- 
ing measure, but how well they 
thought they were doing is not known. 
The fact that their ratings of fellow 
group members declined may be ex- 
plainable on the basis of effort, using 
the concept of reactive inhibition 
(Hull, 1951). It is reasonable to infer 
that the Hil-HiCo girls exerted more 
effort than children in the other ex- 
perimental conditions both from their 
superior performance on the learning 
tests and from the fact that they 
worked fastest on LTI where a time 
measure was obtained. It has been 
suggested (Lott, 1963) that perhaps 
stimuli (other group members) asso- 
ciated with effort come to evoke 
avoidance tendencies just as reward- 
associated stimuli come to signal ap- 
proach. 


In general, findings of the раи 
study сап be said to have provide 
qualified support for the authors e 
tial hypothesis and to have raised 
new questions of broader scope for 
subsequent investigation. 
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EFFECT OF INSTRUCTIONAL SET AND VARIETY OF 
INSTANCES ON CHILDREN’S LEARNING’ 


HARRIETT AMSTER 
Institute of Human Learning, University of California, Berkeley 


In experiments with children of approximately 5 and 10 yrs., learning 
under intentional set was found to be significantly superior to inci- 
dental learning in children of both age groups. Moreover, under inci- 
dental set older children showed a reliably greater gain due to practice 
than did the younger ones. Also, for the 10-year-olds, the presence of 
irrelevant cues inhibited learning and decreased improvement due to 
practice. No consistent effects were observed for the variety variable, 
but for both groups there was some trend for the small-variety condi- 
tion to foster learning when irrelevant features were not included in 


the stimulus. 


The present investigation considers 
developmental differences in concept 
formation and memory for concepts. 
Two contrasting types of responses 
which could underlie the learning of 
concepts have been inferred by vari- 
ous investigators (e.g, Inhelder & 
Piaget, 1958; Vinacke, 1952). Con- 
cept formation of the deductive type 
involves use of strategies which might 
entail deductive reasoning and syste- 
matic testing of hypotheses; concept 
formation of the intuitive or associa- 
tive type entails associative responses 
to the presented stimuli. The present 
study bears on the hypothesis that 
these types of concept learning may 
differ in prominence depending on the 
age of the children and on their in- 
structional set. 

Set-induced differences in the 
relative dominance of deductive and 
associative responding would be ex- 
pected on the basis of the descriptive 
research by Inhelder and Piaget 
(1958) who noted a marked contrast 
between the intuitive thinking of 
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auspices of the Institute of Human Learn- 
ing which was supported by a grant from 
the National Science Foundation. 


children aged 4-7 and the operational 
and logical thinking of young adults. 
Similarly, others (e.g., Osler & Traut- 
man, 1961) have found experimenta 
evidence in support of similar COP 
trasting processes of thinking occ” 
ring differentially as a function of age 
Instructional differences in d 
relative dominance of deductive а09 
associative responding would be ien 
pected on the basis of the рген 
Writer's earlier study (Podell, 1925. 
whieh suggested that for college gu 
dents deductive behavior occurs iJe 
ing intentional concept learning Ж 
associative responding predominat 
during incidental concept learning. 
the present study two instruction® 
sets were employed, an intention” 
set in which the subjects (Ss) jio 
instructed to deliberately form а C? е8 
cept of the similarities among а 8. 
of “flags,” and an unintentional set ye 
which they were instructed to 28 
aesthetic judgments about the cen 
series of stimuli. There is reason to P z 
lieve (Podell, 1958) that similar 77. 
structions can, at least among colle? 
Students, induce with different! e 
dominance, deductive and associat} Ж 
responding. It was further hypoth? 
sized that 10-year-olds would resp? e 
in a manner similar to the colle? 
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students. that i 
ido pnis is, would respond differ- 
tional заві ig and uninten- 
Hisce eret ack but that 5-year- 
А may not y t have learned 
deductive strategies et have learned 
assoeiatively E р would respond 
structions. ` nder both types of in- 

T s. 
GO ee differences in 
tional oe between inten- 
may be be я incidental instructions 
their iie by the observation of 
he degree iis a stimulus variable, 
from which d variety in the instances 
cording t 1 the concept is formed. Ac- 
arge us the present definition, i 
ber of differ w ould consist. of a num- 
‘nd the е in instances of a concept, 
Would eps ge small variety 
instances sist of an equal number of 
lere ap of the concept wherein 
thus ken fewer different instances; 
hui. instance is repeated more 
Отор кб ne latter than in the 
Ss emplo S On a logical basis, when 
arge Sn id deductive strategies, & 
Vantage тебу should provide an ad- 
le [Б л. a small variety because 
seg bh variety permits false hypoth- 
а small ph ag in fewer trials than 
етер ariety does (Podell, 1958). 
Vely ur when Ss respond associa- 
here red incidental instructions, 
v "e арр question as to the rela- 
ety d of the two degrees of 
Variety n the one hand, the large 
Summation Л, permit more efficient 
iU On the of correct associations, 
act that ү other hand, in view of the 
Peated m particular instances are re- 
Nemo More often in a small variety 
е ОТУ for con on к id 
sty facilitated cept instances shoulc 
ac ied th x by it. Podell (1958) 
dents , ese hypotheses in college 
long on found that under inten- 
5 ав € S the large variety 
аце т тоге effective in 
~ Small amg, S formation than 
ariety, However, when un- 


м 
e 


intentional instructions were em- 
ployed, the small variety was found 
to be more effective. It would appear 
that the superiority of the small va- 
riety under unintentional instructions 
stems from the dependence of unin- 
tentional concept learning on the abil- 
ity to reeall exemplars of the concept. 

The present experiments investi- 
gate in 5- and 10-year-old children 
the learning of concept instances un- 
der an intentional and an uninten- 
tional set and on ihe basis of large 
and small varieties of instances. Tt 
was expected that, when given inten- 
tional instructions, the older children 
to whom a large variety was given 
would excel over ihose given a small 
variety, but no corresponding differ- 
ence would be expected for the 
younger children. Furthermore, the 
older children should show better re- 
eall of instances on the basis of the 
intentional set than the unintentional 
set, but for the younger children, any 
differences in recall as à function of 


set should be smaller. 


EXPERIMENT 1 


Method 
Subjects. The experimental procedures 
were administered to eight intact classes 
) Unified School 


fornia 
kindergarten children 
th-grade children who 
ere postexperi- 


in the Berkeley (Cali 
District. Sis ty-four 


and 64 fourth- and fif 
n test W 


pas d a color vision t [ 
mentally selected аз 58. The Science Re- 
(SRA) Primary Mental 


ssociates 


administered to 
The mean me age (MA) of 
6 years, 4 months, and 
was 4 years, 10 months to 7 

months. The mean MA of 
»-grade Ss was 12 years, and 
2 months to 14 years, 
al age (CA) of 
ranged from 4 у 11 
2 months, and th CA of 
ranged from 9 
9 months. 


arten Ss was 


years, 
fourth- and fft 
the range Was 9 years, с 
The chronologic 


10 months. 

kindergarten Ss 
months to 7 yea 
fourth- ant 
years, 4 months 
` Experimental 
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there were four major experimental sub- 
groups, corresponding to the 2 X 2 design 
by which two levels of varicty of concept 
instances and two types of instructional set 
were manipulated factorially. In addition, 
within each subgroup Ss were presented 16 
different booklet types. There were 16 dif- 
ferent Ss within each major subgroup at 
each age level, and cach S received a 
different booklet; thus confounding the two 
sources of error. During the initial testing, 
many more than the 64 Ss at cach age level 
were assigned randomly to conditions and 
tested for color perception, An equal num- 
ber of Ss from cach subgroup were post- 
experimentally selected at random from 
among those who passed this test and to 
whom the experimental conditions had been 
administered. 

Materials. First to be considered is infor- 
mation relevant to the stimuli presented to 
an individual S. Each S received a booklet 
containing eight pages of flags, a blank 
page, two response pages, and envelopes 
containing materials for testing learning, 
There was only one flag on each of the eight 
pages. For this experiment, each flag con- 
tained four geometric shapes (circle, square, 
triangle, and star), each in a different color 
(red, green, yellow, and blue), and each 
placed in the center of a quadrant, The 
flags shown in Figures 1 and 2 each contain, 
їп addition, three irrelevant forms in the 
center column. If these three s were 
deleted, Figures 1 and 2 w ЫБ of 
sample booklets for Experiment I. Ignoring 
the three irrelevant, forms, Figures 1 and 2 
each contain the eight flags for one S. Fac- 
tors which were constant across pages within 
a booklet include the color of cach shape 
(eg, а square may always have been red) 
and the pattern on each quadrant (e.g., ver- 
tical stripes may always have appeared on 
the upper right quadrant). In addition, other 
constant factors varied among shapes, For 
the shape at Complexity Level 1, both po- 
sition and size remained constant; for the 
one at Complexity Level 2, position re- 
mained constant but size varied; for the 
one at Complexity Level 3, position alter- 
nated between two quadrants, but size re- 
mained constant; for the one at Complexity 
evel 4, both position and size were varied, 

The pages of the booklets were con- 
structed from separate parts—sheets of 
paper approximately 9 X 11 inches which 
contained a flag outline, four labels in the 
size of the quadrants of the flag which had 
different. background patterns, cight shapes 
(he four shapes each in two sizes) which 


were die cut from four colors of gum paper 
The flags were approximately 7⁄4 X 572 
inches. The diameter of the large shapes 
was approximately % inch. К 

Each S was given three envelopes which 
contained the following materinls for test- 
ing learning: crayons, cach of the four 
shapes in both sizes cut from white gummed 
paper, and four labels printed with the ap- 
propriate background designs. Each desk 
was set up before Ss arrived with a booklet 
and four envelopes each closed with a rubber 
band. 

Turning next to information relevant to 
the entire set of stimuli, across the entire 
set of booklets it is noted that each of the 
four shapes occurred in two sizes and four 
colors. The particular assignment of shapes 
to position, color to shape, and shape to 
complexity level was made with the aid of 
a Greco-Latin square, On this basis, 16 dif- 
ferent object, a üignments were made. The 
background labels were placed on the quad- 
rants in one of two spatial arrangements: 
Half of the object assignments employe 
one of the background arrangements anc 
half employed the other. A pair of book- 
lets which differed in the variety of dif- 
ferent instances was constructed for each 
object assignment. One was a small-variety 
booklet, (Figure. 1, omitting the three 3 
relevant forms) in which there was more 
frequent repetition of the same page than 
їп the large-variety member of the pail 
(Figure 2, omitting the three irrelevant 
forms). In all, there were 16 different large- 
variety booklets and 16 different small-vari- 
cty booklets which corresponded to the 
large-variety booklets in respect to the par- 
ticular assignment of color, form, position 
complexity level, and background. f 

. The arge-variety booklets consisted 0 
cight diferent instances. For each object 28 
Slgnment there were eight possible ways 0 
making a page, and the pages of the large- 
mocly booklets corresponded to thee 

hus, each page was different from every 
other page, as shown in Figure 2. For ihe 
small variety, two different instances Were 
selected from the eight, and each of thes? 
Was repeated four times. The selection © 
the two pages Was random with the restric- 
tion that the two reflected the alternative? 
required for complexity levels 2, 3, and, 

Procedure. Tn general, the procedure 10" 
volved a study trial in which S observet 
ihe flags by turning the pages of his OW 
booklet, constructed his own flag, observet 
the same flags in the same way for a secon" 
trial, and, finally, constructed a second fag 
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The experi 
sets of ee (E) read one of two 
n the Mi ieu and paraphrased them. 
Were eld dios for the intentional set, 
things whia that they were to “notice the 
see,” while ? are in all the flags they will 
dental en m the instruetions for the inci- 
you m £ | were told to “notice how well 
to turn Bir 1 one.” Ss were not permitted 
m the fidi until instructed. to do so as 
Onds were nistration of the SRA. Ten sec- 
before fic given for viewing each page 
Was EAM rington to “turn the page” 
Pages Mr A here was no interval between 
rni ter than the 1 second allowed for 
Minded to драва. were continually re- 
okle,” TI еер your eyes on your own 
а blank ds A ninth page of the booklet was 
he vigas ым. Immediately following 
following A Ss were asked to perform the 
malt white os in sequence: (а) open the 
Ше forms nvelope, (b) remove the eight 
ut on the which it contained, (c) lay them 
CACHE pa page, (d) open the small 
) color t} and remove the crayons, and 
le white forms the same color 


nt, i, pr шн. After the ex- 
M ers forms identical to those 
` Ppe ared on the flags were ds- 
ch тиске kindergarten pupils. They 
The orm with to color an area beneath 
Biver Ourth- та crayon of the same hue. 
ou simi] td fifth-grade pupils were 
8 ang , task and were also shown 
pila Ded to write their color 

making more than one error 


1 Et йз constructed by Ss were 
= d scoring S ependent scorers using a 
DS in d Sheet A third judge made 
R tly Such е cases on which scorers 
Ores "mon instanees oceurred predomi- 

меп Ss. Seven cate- 


[Eod kinder, 

i Ге score 

“hag Possible cored with respect to each of 
Dla cd: ^ parts of the flag. These in- 


Чо " 
"eq 1e m ^ 
8 imber of forms which were 


Ywhere Я 
cre on the flag—maximum 


correct was four different shapes on one 
flag—; the number of correct colors placed 
on the forms—maximum was one form 
colored each of the four colors—; the cor- 
reetness of the assignment of color to form 

газ four forms colored the four 
regardless of whether they 
correct position—; the cor- 
reetness of the placement of the forms on 
the background regardless of quadrant ; the 
correctness of placement of the forms on a 
the correctness of the assignment 
of color to background regardless of whether 
the form or quadrant was correct; the cor- 
signment of color to quad- 
of whether form or back- 
maximum total 


cok correctly 


appeared in the 


quadrant ; 


rectness of the 
rant regardles: 
ground was correct. The 
score was 28 points. 


RESULTS 
ariance and covari- 
ance were applied separately to the 
learning scores for Trials 1 and 2. The 
covariate, MA, was not found to cor- 
relate significantly with the learning 
17 for Trial 1, 7 = 22 
for Trial 2). The F ratio for regres- 
sion was 3.49 for the analysis of co- 
variance for Trial 1, and 4.23 for 
Trial 2 (p € -10, P < .05, respec- 
tively). The analyses of covariance 
showed only one statistically signifi- 
eant effect—instructional set. Аз ex- 
peeted, intentional learning was su- 
perior to incidental learning for both 
Trial 1 (F = 5.27, df = 1/95, P < 
05) and Trial 2 (F = 6.80, df = 1/95, 
p < 05. Table 1 presents the mean 


der the two sets as a func- 
level of Ss Al- 


Analyses of v 


scores (r — 


scores ul d 
ton, of the grade =. : 
though the effect of Variety did not 
€ ficance, the interaction 


[ variety showed a 

of Booklet Set and Variety 

trend (F = 2.50, df = 3/95, p < -10). 
the trend 


For booklet types showing 
at this level of significance, the Small 


Variety condition was more facilitat- 
ing than the Large Variety. Although 
there were no significant interactions 
between trade and Set, these data, 
as shown in Table 1, do suggest that 
improvement from Trial 1 to Trial 2 


approach signi 
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TABLE 1 
Scores OF Е Two AGE 
OF 

ny TRIAL 


Mean LEARN 


Experiment T 


Trial 1 


Sets and conditions 


M SD M SD 


Intentional set 
Kindergarten | 12.31 4.27 
4th &Sthyrade 20.06 6.12 

Incidental set 
Kindergarten | 11.47 5.04 
4th Sth grade) 16.16) 4.93 

Large variety 
Kindergarten | 11.88 4 
4th &5thgrade 17.16 

Small variety | 
Kindergarten | 11.91 
3th &5th grade, 19.06 


11.90 4.60 
21.47, 5.02 


did depend upon Grade and Set in 
that kindergarten Ss showed less im- 
provement with practice than did the 
fourth- and fifth-grade Ss, Analyses 
of variance and covariance of the 
difference in performance between 

Trials 1 and 2 were also carried out, 
and the interaction between Set and 
Grade was significant (F = 5.47, df = 
1/95, p < .05). n 

Analyses of variance and covari- 
ance were computed using another 
score—the number of times color was 
correctly assigned to forms, The total 
possible correct for the two trials was 
eight. As for the other measure, Set 
was the only variable which showed 
a statistically significant effeet (F = 
7.01, df = 1/95, p < 01). The mean 
scores for the kindergarten Ss were 
2.09 under incidental learning 
3.34 under intentional learning, 
for the fourth- and fifth-grade 
5.03 under incidental learning 
6.22 under intentional learning. 

A Friedman two-way analysis of 
variance was carried out to test the 
hypothesis that the number of correct 
assignments of color to 


and 
and 

Ss, 
and 


forms de- 


creased with complexity level. In- 
stead of performing this analysis for 
individuals, the four individuals re- 
ceiving one type of booklet were 
treated as one individual. Scores were 
assigned for correct color on form for 
each complexity level based on the 
number of persons who assigned. the 
color correctly; these could vary from 
0-4 correct. Separate analyses were 
computed for the two trials and for 
the kindergarten and. fourth and fifth 
grades. The X, values ranged from 
-30-.80 and did not approach statis- 
tical significance. 


Experiment II 

No evidence for the hypothesized 
interaction between Instructional Set 
and Variety of Instances was 0b- 
tained in Experiment I, and an analy- 
sis of the way in which Variety w2* 
manipulated suggested that the effect 
might be observed if a different moc 
of explicating this variable were ет" 
ployed. The definition of variety 3 
ready employed was the number 9 
different instances of the concept i 
be formed in which the instances only 
contained features relevant to the 
concept. By contrast, Experiment, 
included, in addition, a new definition 
of variety—the number of differen" 
irrelevant features within the entire 
set of exemplars from which an 5 
formed the concept. The instance? "i 
Experiment I contained only relevan’ 
features, the instances in Experimen 
II contained both relevant and irele” 
vant features, and the variety of xe 
was manipulated independently by 
means of a factorial design. 


Method 


Subjects. The experimental 
were administered to eight intact 
and fifth-grade class s which were rouse’ = 
comparable in all objective respects to tho 


cs 
procedure Р 
fourth 
ghi 
di 


; ^. E - wis 
used in Experiment I. A total of 61 55 V 8 


included, distributed among 8 groups oup 
Ss cach. The mean MA of the total 2" 
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was Р 
poet p 9 months, and they ranged 
months, ‘The í 0 months to 15 years, 7 
months t ne CAs ranged from 9 years, 3 
Maines, П years, 5 months. 
were identics Booklets were prepared which 
» except = to those used in Experiment 
\гее irr zm as shown in Figures 1 and 2 
Center dr evant forms were added to the 
teen [Pe ena page of each booklet. Six- 
in Сы мны were selected from those used 
variety bo ae I of which half were large- 
Tespondin, oklets and half were the cor- 
booklets те small-variety booklets, and the 
atin shied presented two of the four Greco- 
he 16 han employed in the earlier study. 
32 in all s ir were duplicated, making 
dded is ree irrelevant color forms were 
either a PE page such that there was 
orms, One E or a small variety in these 
Biven a s of the two identical booklets was 
and the us all variety of irrelevant. forms, 
edem s was given a large variety. 
lows: The evant forms were selected as fol- 
Of the Ee pool of possible forms consisted 
Color, T} Shapes in each size and in each 
Which ihe yielded 32 possible forms from 
Ma ive 6 forms used as the relevant forms 
Televant ү booklet were excluded. The ir- 
Maining ойы selected from the re- 
Were forms. For the large variety, sets 
Cach pa, selected. without replacement. for 
9f the 8. Thus each booklet require 
Variety A ERA For the corresponding small 
Were Pat aera forms 2 of the sets of 
repeated ected at random, and each was 
there wore pt pages of the booklet. Thus 
Small M 32 booklets of 4 types: large or 
forms, aaa with respect to the relevant 
Spect ‘tg large or small variety with re- 
„тоо е irrelevant forms. 
ered in ep eae II was adminis- 
criment, ctly the same way as the first. 


RESULTS 


Analy 
formes Ses of covariance were per- 
ав the on learning scores with MA 
analyse control variable. Separate 
i bres for Trials 1 and 2 and the 
Te sn Scores were computed, but 
тедеп no reliable effects. Table 2 
the rst means of the main effects for 
Sig and second trials. An analy- 
Вен, a ee mixed design, was 
Тош € in which the results of the 
Were nd fifth grade for Experiment 
combined with those of Ex- 


le 


TABLE 2 
MEAN LEARNING SCORES FOR 
MAJOR VARIABLES 


Experiment II 


Trial2| SD 


Sets and conditions | Trial 1| SD 
Irrelevant va- 
riety 
Small 14.69) 4.79 | 16.03) 5.13 
Large 14.99] 4.86 | 15.06) 4.99 
Variety 
Small 13.84] 4.22 | 14.66) 5.06 
Large 15.06] 5.30 | 16.44) 4.94 
Set 
Intentional 15.31) 4.26 | 16.03) 5.41 
Incidental 13.59] 5.19 | 15.06) 4.68 
periment II. The variables of this 


luded Presence of Irrele- 
which for Experiment 
Irrelevant Forms 


analysis inc. 
vant Forms, 
I was Absence of 
and for Experiment II was Presence 


of Irrelevant Forms. In addition, Va- 
riety, Set, and Trials were employed 
as in the separate analyses. It reveals 
that intentional learning was signifi- 
cantly better than incidental learning 
(F = 833, df = 1/120, p < .01), that 
learning was greater in the absence oi 
irrelevant stimuli than in its presence 
(F = 48.00, df = 1/120, p < 001), 
and that improvement occurred from 
trial to trial (F — 29.36, df — 1/120, 
ddition, the interaction 


p< 01). Ina 
between Presence of Irrelevant Forms 
and Trials (F = 10.12, df = 1/120, 


cates that the improve- 
ment from trial to trial was greater in 
the absence of irrelevant forms than in 
their presence. In the presence of ir- 
relevant forms the mean for Trial 1 
was 14.45, and for Trial 2, 15.55. In the 
absence of jrrelevant forms the mean 
for Trial 1 was 18.28, and for Trial 2, 
22.48. The trend of the means for Va- 
riety Was opposite for the two experi- 
ments. However, the interaction be- 
tween the Presence of Trrelevant Forms 
and Variety did not reach significance 
(F = 2.60, df = 1/20, p > 05). 


< 01) indi 
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ExPERIMENT III 


The effect of the variety of the ir- 
relevant forms was studied in a kin- 
dergarten sample without the simul- 
taneous manipulation of the variety of 
the relevant forms. 


Method 


ubjects. Kindergarten pupils from two 
m the Berkeley Unified School Dis- 
trict served as Ss. The 32 pupils preexperi- 
mentally selected as Ss ranged in age from 
4 years, 11 months to 5 years, 11 months. 
The mean MA was 6 years, 4 months, with 
a range of 5 years, 5 months to 7 years, 4 
months. Ss were tested with the SRA intel- 
ligence scale and were assigned in advance 
to the subgroups in the experiment on this 
basis. The subgroups, numbering 8 Ss each, 
were roughly matched in mean and y 
ance for MA. 

Materials. There were 16 different book- 
lets, of which 8 contained a large variety of 
irrelevant forms, and 8 had a small varicty 
of irrelevant forms, Variety of relevant forms 
was not manipulated, Eight booklets were 
selected from among those used in Experi- 


ment I. One page which contained four large 
forms was sele eight 


ected from each of the 

booklets, New booklets were constructed by 
adding three forms to the one page which 
was repeated eight timos for the e 
There were 16 different booklets in all, 8 
large-variety and 8 corresponding small- 
variety booklets. Three irrelevant forms were 
added to each page as described above for 
Experiment IT. A smal] variety consisted of 
two instances from a large variety, each ro- 
peated four times. 


ari- 


ight pages. 


RESULTS 


Analyses of variance were com- 
puted for the learning scores of Trials 


TABLE 3 
SUMMARY OF MEAN LEARNING 


Ѕсовеѕ 


Experiment ПІ 


Sets and conditions | Trial 1 SD |Trial2] sp 
Variety 
Small 7.81 56 | 8.13 | 3.20 
Large 8.81 | 1.56 | 7.63 | 1.96 
Set 
Intentional 8.94 | 2.02 | 8.13 | 2.66 
Incidental 7.69 | 2.15 | 7.63 | 2.66 


1 and 2 and for the difference score, 
but no reliable effects were observed. 
The main effects are shown in ТИ 
3. Comparison of the scores obtained 
by kindergarten children in Experi- 
ment I with those obtained in “a 
periment III shows that a decline 0 
about 50% was brought about by the 
introduction of irrelevant forms de- 
spite the fact that there was 29 
variation in the relevant forms for Ex- 
periment IT. If irrelevant forms. ben 
not present, the task for Experiment 
III would have been easier than = 
Experiment I; thus, the magnitude 
the depression of performance broug® 
about by irrelevant stimuli is, in re- 
ality, greater than the observed value. 


Discusston 


As expected, intentional learning 
was found to be generally more peo 
tive than incidental learning. The ioe 
vantage of intentional learning me 
hypothesized to be greater for diis 
Ss than for younger ones, but no eve 
dence in support of this hypothesi 
Was obtained in Experiment I. How" 
ever, these data do reveal that Dd 
dergarten Ss improved with practic 
under intentional set to a greater Ti 
gree than under incidental set. ғ 
contrast, the older Ss showed greater 
improvement under incidental $ê 
than under intentional set, The а 
of the means in Experiment II зе 
Consistent with these results d 
fourth- and fifth-grade Ss, Qu! 
Possibly, under incidental learning: 
being tested induces a change in th 
set of the older Ss; for example, ui 
alerts them to the need to engage Hs 
intentional learning while younger ^7 
may continue to respond on an inc! 
dental basis, On the other hand, T 
10-vear-olds continue to learn incr’ 
dentally, the effect of practice on л 
cidental learning is greater for це 
than for younger Ss. Also indire 
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oe was obtained that the pres- 
7и irrelevant forms depresses 
effect " and the magnitude of this 
than ^s as greater for kindergarten Ss 
А г older ones. 
eu answer concerning the 
the = З ariable was not afforded by 
Was pee experiments. While there 
Willen s for the small variety to 
exemplar concept learning when the 
eatures Ts contained no irrelevant 
manifes? this effect was not at all 
similar Ф. р they were present. А 
ion of Tend, toward greater facilita- 
as been oe by a small variety 
erent ire for a somewhat dif- 
1965) pa (Amster & Marascuilo, 
curred owever, this effect has oc- 
ut has under special circumstances, 
eral stu. m been consistent over sev- 
bal co ies of the acquisition of ver- 
1963) ee (Amster, 1965; Podell, 
for the n which the large variety has 
tive, most part been more facilita- 
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REACTIVE INHIBITION AS RELATED TO 
PERFORMANCE ON PROGRAMED 
LEARNING MATERIALS 


LOWELL SCHOER 
College of Education, University of Iowa 


canceling task 36 Ss who 
and 36 who generated it 


interactions amo 
significant. 


performance. None of the 
ocabulary, and item type were 


es 


Several studies of the role of re- 
active inhibition (RI) 
learning have resulted 
(1952, 1957) 
construct 


in human 
from Eysenck’s 
incorporation of this 
І into his theory of per- 
sonality, 'The hypothesis that sub- 
Jects (Ss) who generate RI quickly 
will not learn as efficiently as those 
who do not has been supported in 
studies using classical conditioning 
(Franks, 1957), verbal conditioning 
(Eysenck, 1959), and reading achieve- 
ment (Lynn, 1960; Otto, 1963). 


The present study was designed to 
determine whether 


this hypothesis 
would be supported when the learning 
task involved programed materials. 


The degree to which intelli 
test-item type might influence the 
effect of susceptibility to RI as re- 
lated to performance on this type of 
material was also investigated. 


gence and 


METHOD 
Subjects 


Ss were 72 students enrolled in an under- 
graduate course in educational psychology. 
"These 72 students were chosen from a class 
of 107, on the basis of performance on a 
number-canceling task which was used as 
a measure of RI. Each student was given 
two 8% X 11-inch sheets of paper with rows 
of random numbers on them and was asked 
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to “x” over all the 5’s. Total time given era 
3 minutes, with Ss notified cach 30 secon 
to make a heavy line so that scores per nd 
Seconds could be determined. At the wate 
of the 3 minutes, all Ss took a Ta a 
break and then canceled for an additio]. 
30 seconds. The total number of ene 
inspected during each of the 30-second pi 
tervals was then determined for each 8, p" 
the difference between performance ON just 
first 30 seconds and the 30 seconds os 
prior to the break was determined as just 
the difference between the 30 seconds jte 
prior to the break and the 30 seconds 2 re 
the break. The difference between the 8—0 
9n the first 30 seconds and the 30 pour 
just prior to the break was used as the 
measure of RI. The difference between the 
scores On the 30 seconds just prior tO ak 
break and the 30 seconds just after the cn 
Was used to determine whether the task Es 
Ployed to measure RI would show incre" > 
scores after a rest interval, and thus progre 
evidence for the validity of the proce ibi- 
used to establish the high- and low-inhi 
tion groups. ps 
The 107 Ss were divided into two Era 
On the basis of their score on a vocabu m 
test, with those above the class median ү 
In the high the 
below, in th 
n order 
motivation ti 
this effect o 


“vocabulary group ап 
e low-vocabulary group. in 
to control for differences o 
о cancel numbers and re¢ 

; n the results, the placeme? po 
53 In an inhibition group within each | sty 
cabulary Sroup was done in two steps. Ж 
8s within each group were ordered OD | 


А 5 ;ng the 
basis of their canceling scores durin£ 
first 30 seconds, 
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NS k 
- a ie Ss were then matched as closely 
the fret А оп the basis of their score during 
pir with 4 seconds. The member of the 
Pecan the smaller drop from the first 30 
reak pis the 30 seconds just prior to the 
lie EE in the low-inhibition group. 
Signed to P with the larger drop was as- 
of proble Ше high-inhibition group. Because 
Sidene ms in matching, only 72 of the 107 
his | in the class could be used. 
BA fadum Ікеа in four groups of 
(V-H h-voeabulary high-inhibition group 
group p) low-vocabulary low-inhibition 
hibition Ч -LI), a high-vocabulary low-in- 
ary hi рр (HV-LD, and а low-vocabu- 
Y Migh-inhibition group (LV-HI). 


А! , 
Taterials and Procedure 


T В 

erpen earning material employed in the 
Mately тә was a programed unit of approxi- 
Statistics СЕ frames оп measurement and 
Prepared, | T material was a linear program 
Opies su fy the present writer and covered 

variabilis measures of central tendency 
earession, ility, reliability, validity, norms, 

s in ui and predietion. It was presented 
Mately sn xtbook form. Each S did approxi- 
Was not frames per day for 15 days and 
Outside permitted to study the material 
fact and а The students’ errors on 36 
items a 36 application-type multiple-choice 
erion 5 the end of the unit were the cri- 

Scores for the experiment. 


Resuurs anp 018008810 


ne Type I designs (Lindquist, 
the "in 267) were used to determine 
establi ty of the procedure used to 
S Oups h the high- and low-inhibition 
Scores | Table 1 shows the mean 
Ss on Fa the high- and low-inhibition 
Secon, he first 30 seconds and the 30 

S just prior to the break. The 


Mean N TABLE 1 
Low. hoe ІхѕрЕСТЕр BY HIGH- 
Seconps 10% Ss DURING THE First 30 
AND 30 Seconps PRIOR TO 

THE BREAK 


AND 


s: 
: First 30 sec. | Last 30 sec- 


Hi 

ыы: 

Loy intibition 129.6 | 
ibition 128.8 117.6 


TABLE 2 
Mean NUMERALS IxsPECTED BY HIGH- AND 
Low-ĪNHIBITION Ss DURING THE INTER- 
VAL IMMEDIATELY PRECEDING AND 
THAT IMMEDIATELY FOLLOWING 
THE Rest PERIOD 


Ss Preceding Following 
High-inhibition 99.0 110.8 
117.6 128.2 


Low-inhibition 


main effect for interval was signifi- 
cant at well beyond the .01 level 
(F = 103.59, df = 1/70). The inter- 
action between inhibition group and 
interval was also significant at well 
beyond the .01 level (F = 22.09, df = 


1/70). This suggests that there was 4 


significant drop in performance across 


the 3-minute period, and that the 
drop tended to be different for the 


high- and low-inhibition Ss. 


TABLE 3 
Mean Error Scores OF Hranu- AND Low- 
INHIBITION Ss ON 36 Fact AND 36 APPLI- 
cation-TyPE ITEMS BY VocaBULARY 
LEVEL 


High vocabulary Low Vocabulary 


А Appli- Appli- 

Fact | соп | Fact | cation 

igh-inhibiti 4 10.22 

High-inhibition 3.94 | 8.61 | 4.44 2 

Low-inhibition 9 | 9.33 6.06 | 11.61 
Note.—N = 18 per cell. 


RI dissipates during à rest period, 
so a test of the validity of the pro- 
cedure used to set up the two groups 
of Ss was 2 comparison of perform- 
ance during the interval immediately 

the interval immediately 


before and 3 
4 he rest period. 'The mean 


following t 
scores of the two groups of Ss during 


these two intervals are shown in Table 
2. The difference between the inter- 
vals is significant at well beyond the 
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.01 level (F = 43.85, df = 1/70). It 
should perhaps be noted that reminis- 
cence did not differ for the two groups, 
being 11.8 for the high group and 10.6 
for the low group. This could be in- 
terpreted to suggest that the rate of 
dissipation is independent of amount 
of RI accumulated, but no direct test 
of this hypothesis was attempted. 
This finding is, however, contrary to 
what would be expected from RI 
theory. 

A Type VI design (Lindquist, 
1953, p. 292) was used to analyze 
the error scores on the test taken by 
Ss at the end of the unit. Vocabulary 
was treated as a between factor. 
Item type and inhibition group (be- 
cause of the matching involved) were 
treated as within factors. The mean 
error scores for the various groups 
are given in Table 3. The F for 
vocabulary was not significant, nor 
were the Fs for any of the interac- 
tions involving vocabulary. The F 
for item type was significant at well 
beyond the .01 level (F = 183.81, 
df = 1/34), but none of the interac. 
tions involving item type were signifi- 
cant. The F for inhibition group was 
significant at greater than the .025 
level (F = 6.25, df = 1/34). Inspec- 
tion of Table 3 indicates that it was 
the high-inhibition group, not the low- 
inhibition group, which made the 
fewest errors, and this was true re- 
gardless of item type or vocabulary 
level involved. This effect is, of course, 
the exact reverse of that found in 
earlier studies using other types of 
learning tasks. This finding, plus the 
nonsignificant effect of vocabulary- 
test score, suggests that when pro- 
graming materials are involved, the 
influence of certain variables, such 
as intelligence, that have a significant 
impact on performance when other 
methods of instruction are employed, 
may not have a significant impact 


when programed materials are used. 
It further suggests that certain vari- 
ables such as RI which have not 
been extensively investigated in rela- 
tion to classroom performance may 
have a significant impact when pro- 
gramed materials are employed. 

No very ready explanation for the 
direction of the effect of RI in the 
present experiment suggests itself. It 
had been anticipated that because 
programed learning presumably in- 
volves operant conditioning, students 
who generate RI quickly would do 
less well than those who generate it 
slowly. The results of the present 
study when compared with earlier 
studies might suggest that the gener- 
ation of RI as measured in the present 
experiment is related to a [бео 
characteristic that has very differen? 
effects as type of learning materia 
shifts. In this way it may operate 
somewhat like anxiety which mar 
well facilitate learning of a simple 
task, but has a negative effect ОП 
learning of a difficult task. 
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To investigate the relationship be 


and changes in stereotypic beliefs, 


а Гами? of instruments was administered 
en 4 years later whether or not they 
dless о: 


principal findings were that re 
gar 
tendance (a) all groups became 


dogmatic, and with few exceptions, more “oul 
rientation, and (b) there was, for all practic 
academic aptitu 
Female Ss, 


relationship between gencral 


or direction of personality changes. 
underwent a more marked change in 


at college, 


values between 1958 and 1962 than did their 


potheses were advanced to explain some o: 
dixi at 


In . 
1963) ea previous paper (Lehmann, 
Bye ea AB demonstrated that there 


ШШ, changes in the critical 
y а d ability, attitudes of stereo- 
Value opi dogmatism, and traditional 
Ctiveen py of college students 
ears, Tt heir freshman and senior 
Some eh Was also seen that, although 
the 4 ye ange occurred during each of 
took plese’ the most dramatic changes 
Sobhomore during the freshman and 
Beste ioe years. These findings sug- 
that em) previous research findings 
: Year based upon a time interval 
test mae! Where only a pre- and post- 
Neous] € administered, could be erro- 
he sneer preted. 

“special udy of attitudes and values, 
eset is those relating to change, is 
Seount 28 variety of problems not 
Search. pe _in cross-sectional Te- 
ete ресійсаПу, some of these 
Aes sale to the reliability of 
tebrer measurement errors 
ssion phenomena, tests of sig- 


Drop 
dig, 
an 


1 


Th 
De е ге; 
Uae fen reported in this article was 
mated St ursuant to a contract with the 
ent ates Offi 8 
ce of Education, Depart- 


Health, Education, and Welfare. 


tween length of college attendance 
dogmatism, and value orientation, 


to 1,747 Ss as freshmen and 
were still in attendance. The 
f sex and length of college at- 


less stereotypic in their beliefs, less 
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ter directed" in their value 
al purposes, no significant 
de and either the degree 
regardless of time spent 
their attitudes and 
male counterparts. Hy- 
f the findings. 


nificance, and multi 
name a few. 


Although it is so 
that attitudes and values are instilled 


early in life and are most easily modi- 
fiable in infancy and adolescence, cur- 
riculum planners in our colleges and 
universities assume that attitudes and 
values are still modifiable at ages 17- 
22, or older. Furthermore, college fac- 
ulties believe their institutions provide 
experiences which are unique and are 
not found outside the academic en- 
vironment. 
Contradictory evi 
presented regarding t 
lege on student attitudes 
(Jacob, 1957; Lehmann & Dressel, 
1963; Newcomb, 1943; Webster, 1958). 
At the same time, data have indicated 
that from the freshman to senior years 
changes in attitudes and values will 
occur (Jacob, 1957; Lehmann & Dres- 
1963; Newcomb, 1943; Plant, 
ster, Freedman, & Heist, 
hat the degree and extent 
ttitudes and values are 
end upon the nature of 
(Smith, 1955), the type 


ple comparisons, to 


metimes asserted 


dence has been 
he impact of col- 
and values 


sel, 
1962; Web 
1962), but t 
to which & 
modifiable дер! 
the experience 
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and nature of contact (McGuigan, 
1958), the personality makeup of the 
individual (Helson, Blake, Mouton, & 
Olmstead, 1956), the group's approval 
of new attitudes (Rosenberg, 1956), 
and the subject’s perception of the 
outcome (Carlson, 1956). Although the 
importance of the college climate in 
changing attitudes and values of col- 
lege students is apparent in the works 
of Eddy (1957, 1959), Brown and 
Bystrym (1962), Lazure (1959), and 
Newcomb (1962), it has been found to 
be nearly impossible to single out any 
one factor as being responsible for col- 
lege students' attitudes and value 
changes (Lehmann & Payne, 1963). 

One of the major faults of many so- 
called longitudinal studies has been a 
lack of any periodie study of the 
original population between the initial 
and final stages. As Corey (1936) 
pointed out nearly 3 decades ago, the 
only valid manner in which to study 
changes in college students’ attitudes 
and values is by the repeated testing 
of the same students as well as re- 
peated testings with students not in 
college. Barton (1959) recognizes the 
difficulty of having an appropriate 
control group when he comments that 
the comparative panel study, although 
involving problems, is the best design 
unless we are willing to randomly give 
or withhold a college education to some 
groups of young people, 

Although this study does not claim 
to have considered or circumvented all 
of the problems involved in longitu- 
dinal research (e.g., absence of a true 
control group or problems of selective 
mortality), an attempt has been made 
to follow up an original population 
who entered college in 1958, some of 
whom did not attend for 4 years. 

The purpose of this study was to 
investigate the relationship between 
amount of college education and 
changes in attitudes of stereotypy and 


dogmatism, in traditional-value orien- 
tation, and in certain selected attitudes 
and views related to higher education. 
In addition, we were interested in 
studying (a) the relationship between 
changes in selected personality char- 
acteristies and intelligence, and (b) 
whether one sex changed more 
markedly in their attitudes and values 
than the other. In substance, we were 
interested in learning what effect 
higher education has upon the per- 
sonality development of students and, 
on the other hand, whether a non- 
college atmosphere will produce 
changes in the same direction and to 
the same degree as does a college 
atmosphere. 


We do not attempt to identify those 
factors which might have an impact 
upon student personality development- 
Nor are we concerned here with learn- 
ing whether or not college facilitates 


the development of certain behavioral 
traits. 


INSTRUMENTS 


The Inventory of Beliefs (America? 
Council of Education, 1950) was used 0 
measure stereotypic beliefs. A high scorer 
thought to be mature, flexible, adaptive, ай 
democratic in his relationships with oin 
a low scorer, immature, rigid, compulsiva 
and authoritarian in his relationships W! 
others. ince. 

The Differential Values Inventory (тш, 
1957) was used to measure the subien sg 
value orientation. This instrument pen 
subjects’ values along a traditional-emerer 
continuum, Using Riesman’s classification 
one might call the traditional values e 
directed" and the emergent values “оше 
directed” or “other-directed.” 60) 

The Dogmatism Scale (Rokeach, 19 "m 
measured subjects? receptivity to new ide ir 
their degree of open-mindedness, and pus 
authoritarian attitude. High scorers et 
thought to be dogmatic, closed-minded, 49 
unreceptive to new ideas. & 

An Experience Inventory (Lehmann ial 
Dressel, 1963) was designed to measure Бос ев 
and political attitudes as well as attitu а 
towards the objectives of college (vocation 
versus liberal arts), factors important 
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gaini: i i 
em e with students and faculty 
ral impressions of college life. i 


SAMPLE 


T — 

Ned Experimental group was originally to 
ing rer me of those students maintain- 
СУБЕ. MEAE enrollment in college over à 
ut RIVER It was soon obvious, however, 
specificiall 5 udents were withdrawing (more 
erms afi failing to reenroll) for 1 or 2 
college a which they would return to 
lisse Жык unrealistic to eliminate 
exposure t nts from the study since their 
ment) o college—in this case, the “treat- 
that of ean probably no less pervasive than 
erms io canal cnrolling for each of the 12 
Buided by 2 le ue In addition, we were 
Were, for th he fact that these “withdrawals” 
Sophomore he most part, leaving after their 
*hmann & ear, and previous research 
Bt the та Dressel, 1962) demonstrated 
and value ost marked changes in attitudes 
Year, Gan occurred during the freshman 
~or“ enl the experimental group 
aqq бусаг group” as we shall enll i. E 
Бап State UR students who entered Michi- 
Attendance niversity in Fall, 1958, were in 
of the next ү that. institution for at least 9 
е terms, and were registered as 


Studen | 
ts during spring term, 1962. 
of those 


e 
students when group consisted 
58, with ы enrolled as freshmen in Fall, 
enrolled hdrew from MSU, and were not 
i as full-time students at MSU in 


Pring 
{Rother ae Students who transferred to 
one control lege were considered as being in 
tdüivalent group unless they completed the 
ool group ke 10 terms of college? The con- 
i » into bs: divided, for purposes of anal- 
ed of th тее subgroups. Group 1C con- 
MCA of colle subjects who had less than 1 
за p mbi. attendance; Group 2C con- 
ee 2 dete who had more than 1 but 
b Sn ears of college; and Group 3C 
" subjects who had more than 2 
an 3 years of college. 


than 
sisted 
MT 


METHOD 


Tie = 
win 9. ins 
на р: ноу previously described, 
ЧО ОТУ, were ption of the Experience In- 
Lam Werk in administered during orienta- 
ore emale all freshmen (1,436 males and 
exp ity in o pr tenine, Michigan State Uni- 
etimental 1, 1958. The subjects in the 
group were then retested near 


Those 


red 


Sig, 85 c i 
ered ag d ip ара 10 terms were con- 
rt of the experimental grouP- 


TABLE 1 


PosrrEsT SAMPLE CLASSIFIED BY SEX AND 
AMOUNT or COLLEGE EDUCATION 


ч No. returning 
Sample size usable test | % Return 
Group materials 
M Е м F M|F 
Experi- 
mental | 966 | 705 650 | 484 | 67 | 69 
1C 217 | 243 | H0 147 | 51 | 60 
2C 112 | 212| 61 | 104 54 | 49 
3C 141 | 150| 90 | 101 64 | 67 
Total 1436 | 1310 | 941 836 | 66 | 64 


nior year, while those in 
the control group Were retested at about the 
same time by means of a mailed question- 
naire. The analysis of data is restricted to 
those students for whom we have usable 
pre- and postdata. 

Because of initial sex differences, the data 
for the males and females were analyzed 
separately. To test the hypothesis that there 
is no difference jn the degree and direction 
of change in attitudes and values among in- 
dividuals who attended college for varying 
lengths of time, analysis of covariance was 
employed where initial (1958) differences 
among the various educational groups were 
controlled. After the appropriate F was com- 
puted, the Schefié (1959) method of post- 
hoc comparisons was employed in an at- 
tempt to ascertain where the differences 
actually existed. 

To test the hypothesis that there is no 


relationship between changes In attitudes 

ademic aptitude, 
fficient was 
d 1962 test 


computed. I 
initial and final, respectively), regres- 
uted separately 


the end of their se 


and for eac 
trol groups. These regress! 
o predict the | 


then used ti 
n. To compute the 
amount 0 etween 1958 and 1962, 


the difference betwee a 
(1958) and predicted postest (1962) score 

ted for each subject for each of 
These change scores 
correlated with the College Quali- 
st total score obtained in 1958 in 


an attempt to learn whether general aca- 
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demic aptitude is related to certain person- 
ality changes. E 

To test the hypothesis that there is no sex 
difference in magnitude of change in atti- 
tudes and values between 1958 and 1962, 
analysis of covariance was used with the 
initial (1958) test score being controlled. 


RESULTS AND Discussion 


The data relative to changes in 
attitudes of stereotypy and dogma- 
tism, and traditional-value orientation 
among the four educational groups are 
presented in Tables 2-4. Table 5 pre- 
sents the data pertaining to the rela- 
tionship between general academic 
aptitude and change. The data con- 
cerning sex differences in degree and 
direction of change are presented in 
Table 6. 

Reference to Table 2 indicates that 
for both males and females there was 
a marked change in attitudes and 
values between 1958 and 1962, regard- 
less of the amount of time spent at col- 


Means, SrANDARD DEVIATIONS, 


lege. The degree and direction of 
changes might be summarized as fol- 
lows: 

1. There is a significant difference 
among the four groups of males in the 
amount of change in traditional-value 
orientation. Whereas those males who 
had 3 years or less of college became 
more traditional-value oriented be- 
tween 1958 and 1962, their counter- 
parts who completed 4 years of college 
became more emergent in their value 
orientations between their freshman 
and senior years. 

It should further be noted that the 
males in Groups 1C and E underwent 
marked value changes between 195 
and 1962. The difference between the 
groups stemmed not so much from the 
magnitude of change within each 
group, but from the fact that they 
moved in opposite directions in the! 
value orientations (a mean change 0 
+1.94 and —2.40 points, respectively): 


TABLE 2 


F RATIOS ror CHANGES IN ATTITUDES 


AND VALUES BETWEEN 1958 AND 1962 


Level of education group means 
Measure 
т ic 2с зс Е 
Е 
1958 1962 1958 1962 1958 1962 1958 1962 
— 
Males 
Inventory of Be- | 57.03 | 65.82 | поз | ъз | gg 9 
v : . -09 | 7213 | 64.64 | 72.93 | 1 
liefs (13.63) | (12.72) | (14.65) | (13.22) (3.37) | (14.07) | (14.41) | (15.02) 
Differential Values | 34.70 | 36.64 35.74 | 35.82 33.78 34.90 34.81 32.41 | 19.80 
Inventory (T.43) | (8.98) | 6.72) | (воз |, 8) | (739) | (7.04) | (7.08) 
Dogmatism Scale | 172.49 | 155.79 | 172.26 | 151.92 | 167.14 152.42 | 160.63 | шз | 1-29 
(24.84) | (27.99) | (27.03) | (24.34) | (23,89) (23.98) | (25.54) | (22.61) 
Females "uU 
Inventory of Be-| 62.72 | 68% | быз | ом | аз | зу | gs ш 
1 Р .95 78.64 
liefs (11.73) | (13.05) | (14.28) | (13.87) (12.22) | (13.47) ds (14.86) 
Differential Values | 34.38 35.67 32.39 32.80 | 34.51 32.68 34.2 30.39" 
£ А .24 29.63 
Inventory (7.23) | (6.92) | 6-91) | (7.42) | (7.02) | (тюу | esn | (8.79) 
Dogmatism Scale | 108.68 | 151.49 | 164.09 | 148.30 | 163.94 | 149.23 | 169.11 | 146.09 | 0-57 
(25.90) | (23.55) | (28.85) | (23.21) (23.88) | (24.08) | (24.70) | (23.97) 


Note.—Standard deviations appear in parentheses, F is analy: 


*p < 05. 


- ds 
sis of covariance F with 1958 scores used as covari^ 


À 
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TABLE 3 
ScHEFFÉ 
cede diui. COMPARISONS TEST OF 
V ENTIAL VALUES SCORES FOR 
ARIOUs Groups OF MALES 
AND FEMALES 


Groups compared M Sp й 
Males 
2 = © versus | 37.05 
i 32.92 | 0.74 | 5.58* 
20 venne 37.97 
36.18 | 0.46 | 3.89* 
p Versus 37.97 
32.38 | 0.88 6.35* 
ie versus 36.18 
32.38 | 0.86 4.42* 
Females 
30 pes versus | 33.91 
30.41 | 0.20 17.50* 
20 ени 35.26 
32.54 | 0.55 4.95* 
ip versus 35.26 
29.63 | 0.62 9.08* 
ip versus 32.54 
29.63 | 0.64 4.55* 


к Xa — Xp 

* p « Sp 

tional ui» 'This value is not the conven- 

Computed e value with nl and n2 df, but is 
cheffe (1959). by a method found in 


In i 
Ic” 
trad 


ane ums those males in Group 
оле] as a group, towards the 
Ose ma] -value orientation whereas 
th Hr in Group E moved towards 
pa, CORP QUEDA end of the con- 
2. T. 
fe e. = 4 the females, there is à signi- 
ollege elationship between amount of 
Orientation aioe and change in value 
a En "There was some difference 
thecction ue sexes in respect to the 
Yee Tale, change in values. While 
Hors es who had either 1, 2, or 3 


ting 


рее became more tradi- 
ue oriented between 
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and 1962, this was not the case for 
their female counterparts—only those 
females who withdrew during their 
freshman or sophomore year became 
more traditional-value oriented. While 
those males who withdrew from col- 
lege during their junior year became 
more inner-directed, their female 
counterparts became more outer- or 
other-directed. 

Ti is conceiva 
changes were di 


ble, of course, that the 
fferent for the males 
and females because of the differences 
in their experiences. This would be 
especially true if many of the “with- 
drawal” females married, resulting in 
their universe of experiences becoming 
more limited than those of their male 
counterparts. 

3. For both males and females, there 
t relationship between 
amount of college education an 
changes in dogmatism, receptivity to 
new ideas, or an attitude of open- 
mindedness. Although all four groups 
of males and females became less 
dogmatic between 1958 and 1962, there 


is no significan 


TABLE 4 

с COMPARISONS Test OF 
ELIEFS SCORES FOR 

s or FEMALES 


ScHEFFÉ Роѕт-Но 
INVENTORY OF B 


Varrous GROUP 
Groups compared M Sp i 
C and 2C versus 70.45 
3C aa E 77.39 1.20 | 5.78* 
72.38 
Y — 18.08 1.33 | 4.06* 
68.56 
1б fae 12.38 0.96 | 4.00* 
68.56 
T — 78.58 | 1.32 7.50* 
Ka ae Xs 
AAT, 


Sp P 

+» < 05. This value is not the conven- 
e value with n1 and n2 df, but is 
jori by & method found in 


aj = 


comp 
Scheffé (1959). 
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TABLE 5 
CORRELATIONS BETWEEN CQT TOTAL SCORE 
AND CHANGE Scores FOR MALES AND 
FEMALES ON THE INVENTORY OF BELIEFS, 
DIFFERENTIAL VALUES INVENTORY, AND 
DocMaTISM SCALE 


Correlations Between CQT total score 
ап 
оци of 

college Dog- 
attendance у |ЇВсһаше de таба 
score change 

Score score 

Less than 1 61 -09 = —.15 
yr. (1C) (102) (.07) (7.11) (01) 
4-6 terms (2C) 64 =.33%¢ —.09 .29* 
(101) (.00) (.07) (.12) 

7-10 terms (3С) | 135 06 —.13 % 
(150) сї) (—.02) (—.04) 

Seniors (E) 562 —.01 .02 .02 
(430) (.00) (—.01) (.13) 


Note.—First set of fi 

parentheses for females, 
*p < 05. 
"o єлї. 


igures for males; those below in 


words, although both 
males and females in the various 


groups became less authoritarian and 


TABLE 6 
SUMMARY or ANALYSIS ОЕ Covartan 
PosrrEsT Sex Dirrerenc 
TYPIC BELIEFS, TRADITIONAL VALUES 
AND DoGMATISM Using Preresr ' 
Score as COVARIATE 


CE FOR 
ES IN STEREO. 


| F ratios for group 


Measure —————— 
1C | 2C 3C E 
Inventory of Beliefs 0.10 1.10 | 0.30  |37.71*** 
Differential Values In- 5.24* | 1.08 |11.05***/35.05**« 
ventory | 
Dogmatism Scale | 0.45 0.00 | 0.13 [17.4269 
| 
*p < .05. 
*** p < 001. 


more adaptive and flexible between 
1958 and 1962, no one male group 
changed more markedly than аша 
although the converse was true for the 
females. Specifically, the longer 8 
female attended college, the greater ia 
tendeney was for her to become e 
stereotypic in her beliefs. In fact, ig 
magnitude of change occurred 1n 
Somewhat linear fashion. hat 
It is of further interest to note thas 
a temporary plateau was reached be 
tween the female sophomore- aP 
junior-year withdrawals. ib 
It should be readily evident that to 
is very difficult, if not impossible, ga 
isolate a unique factor that would Ed 
plain attitude and value changes © 
cause of the impact and continual ol 
teraction of a variety of oed 
academic experiences, nonacade"" 
experiences, the personality ans 
the individual, and so forth—the re 
searchers must conclude that there e 
changes in certain attitudes and Và m 
associated with college attendan ce 
But they find no compelling evident 
Which leads them to isolate a parti 
lar cause, b- 
5. In those instances where the ir 
tained F was not significant, thus 2 D 
nifying that there was no signifie? ^ 
ifference among the four college 
tendance groups in a particular p 
after 4 years (e.g, for the males 
females there was no significant mm 
in dogmatism associated with amor 
of college education), the directioD — 
Change was the same for all ^ á 
Combining this with the previous E s 
ings, it is seen that regardless of * E 
“mount of time an individual ае? д 
college, the change manifested piam 
years is toward a lessening of ste! p 
Uy pic beliefs and an increase in m, 
tivity to new ideas, Needless to $39? me 
Change was more marked for 80 of 
groups than for others. In the are? n0 
value change, however, there W35 
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clear- 
liene! a pattern evident in the direc- 
шр. hange for the various groups. 
females. the males differed from the 
similarities many respects. The only 
changes p- in the direction of value 
who ait or the sexes were for those 
year — college for less than 1 
бөйдө. betus more traditional-value 
those st ел 1958 апа 1962), апа 
4 years ( ES who attended college for 
ore uid) came more emergent-value 
imas. п ndings appear to be con- 
y Pies. to some of those presented 
of Plant n in agreement with those 
cluded 4. hereas Jacob (1957) con- 
to e college graduates appear 
is chara est a system of behavior that 
We hay cteristic of college graduates, 
Mum seen that some of the with- 
Ype of poe lacs manifest the same 
graduat ehavior as do the college 
acob us in some instances. Although 
(mera шиде that neither courses, 
Ave med nor method of teaching 
values. marked impact upon student 
Was кж it might appear that he 
e Mia when one looks at some О 
Sults of rs presented here—the re- 
1950; r, hm. and other studies (Hall, 
not, ap ehmann & Dressel, 1962) do 
pletely. ATI to support this view com- 
College a in all, it would appear that 
Changes cts as a catalyst to speed UP 
as the that would ordinarily occur 
1962) ee. matures (Plant, 
Pact th ven if this were the only im- 
attitudes college has upon student 
Should s and values, its importance 

Ref not be minimized. 

ere eee to Table 5 indicates that 
Were € two instances (and these 
male wit for the sophomore-year 
lons we sag bea where the correla- 
e hes higher than .25. Although 
Or the a analysis indicates that 
statistically es in Group 2C there is а 
albeit АУ Significant relationship, 
negative, between academic 


aptitude and change in ster i 
beliefs and a significant a po 
tionship between academic aptitude 
and change in dogmatism,’ the magni- 
tude of the correlations strongly sug- 
gest that the relationship is not of any 
practical significance. In fact, one 
might go so far as to say that there is 
no real relationship between general 
academic aptitude and changes in 
stereotypic beliefs, dogmatism, and 
traditional-value orientation for any 
of the groups. These findings support 
those of Plant (1962) who also re- 
ported no significant relationship be- 
tween academic aptitude and changes 
in dogmatism and ethnocentrism. 

It is readily evident from Table 2 
that regardless of sex and amount of 
college education, all groups tended to 
become less stereotypic in their beliefs, 
less dogmatic and more receptive to 
new ideas, and (with the exception of 
the senior males and females, and 
junior-year fem 
traditional-value ori 
1962. It is further evi 
at the net changes be 
1962 that there is no pattern of change. 
In some groups the males changed 
more than the females, and vice versa. 
In addition, there was no pattern evi- 


dent from one test to another. From 
d in Table 6, taken 


in conjunction with that in Table 2, 
to be 


the following ¢ 

warranted: 
]. There was gnificant differ- 
ence between males and females in the 
eshman, sopho- 


three control groups( fres : 
more, and junior-year withdrawals) in 
regard to the decrease in their stereo- 


typie beliefs and dogmatism between 
1958 and 1962. 
cR 

s Actually, 


no si 


the changes are in the same 
direction. The manner of scoring is reversed 
for the Inventory of Beliefs and Dogmatism 
Seale so that a high score on one corresponds 
to a low on the other. ^ 
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2. Female seniors changed signifi- 
cantly more in their receptivity to 
new ideas (lessened dogmatism) than 
did their male counterparts between 
1958 and 1962. | 

3. Both male and female seniors 
became more emergent-value oriented 
between 1958 and 1962. However, the 
female seniors changed to a signifi- 
cantly greater degree than did their 
male counterparts. It is indeed diff- 
cult to explain this phenomenon. How- 
ever, when one recalls that the females 
Stated they were influenced to a 
greater degree than males by such ex- 
periences as “person dated, close 
friend/s, and being away from home 
[Lehmann & Dressel, 1963, р. 183]," it 
is not unexpected that the influence of 
the peers and student-subculture ma- 
trix resulted in females becoming more 
outer- or other-directed than their 
male counterparts. 

Whereas the male junior-year with- 
drawals (3C) became more tradi- 
tional-value oriented between 1958 
and 1962, their female counterparts 
became more emergent-value oriented, 
Once again, the difference in amount 
of change in value orientation between 
the sexes is significant, especially be- 
cause the sexes changed in opposite 
directions. This finding is difficult, to 
interpret inasmuch as both males and 
females were at college for anywhere 
from 7-10 terms. Again, the analysis 
of experiences perceived by these sub- 
jects (Lehmann & Dressel, 1963) as 
having had an impact upon their atti- 
tudes and values is helpful, While the 
males felt that both academic and non- 
academic experiences influenced their 
behavior, the females stated that the 
nonacademic experiences were most in- 
fluential. Furthermore, the five most 
important nonacademie experiences 
mentioned by females related prima- 
rily to the student subculture, and this 
culture matrix, especially for upper- 


classmen, is peer oriented. This ep 
the case, it is to be expected that : 
females would tend to become ms 
emergent or outer-directed in e 
value orientation than would the 
male counterparts. 

4. Although both male and xy 
freshman-year withdrawals (10) is 
came more traditional-value orien les 
between 1958 and 1962, the mid 
changed to a greater degree than 
the females. S 

5. Both male and female pups А 
came less stereotypic in their be S 
less authoritarian, and more gears 
between their freshman and eat 
years. Although there was some bon 
difference when these subjects d 
tested as freshmen in 1958, there bd 
a marked difference in the amoun к 
change that occurred between the “enr 
from 1958-1962. During this У. 
period, the females changed aea 
cantly more than the males. It is me 
ceivable, of course, that the ot 
academic experiences so comm nei! 
mentioned by females resulted in their 
aving more contacts (than did ! or- 
male counterparts) with people ata) 
ing in race, creed, color, persona 
and outlook on life. This being the 
case, we would have expected gre 
females to change to a greater ae 
than the males because of a larger 
verse of experiences, 


Concusions 


"mu 
The magnitude of change in ee 
beliefs, ideals, interests, and atti d i? 
certainly leaves much to be dest av 
terms of the professed aims and o Со 
tives of colleges and universities- om 
lege personnel should not feel T n 
placent about what has peru if 
fact, one might have expected t e 
college does have an impact upon Pr 
Sonality development, the change sist 
the seniors would have been con 


| 
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en 
pen то e pronounced than those oc- 
ee in the withdrawal subjects. 
ina feel that personality 
ens sai through college experi- 
md only be successful with the 
Bote phe above. average in intelli- 
rom, 2 realize that this is not 
ala - ri the case. Although the 
Saves а 5 that the females were 
dens "yea to change than were 
imply à € counterparts, this need not 
аге Neg male attitudes and values 
Various n = not malleable. The 
cantly "wr. e groups did change signifi- 
an in? ut those experiences that had 
those D upon them differed from 
ality Pss influenced female person- 
hen, б elopment. This might suggest, 
aT college administrators at- 
unique pg ta on the experiences 
tempt t o each sex rather than at- 
to all o apply the same prescription 


A 
«tiene attitudes and values are 
easily early in life and are most 
"sii arai in infancy and ado- 
changes’ it is readily evident that 
vs Ks take place from ages 18- 
hat o older. It is therefore imperative 
Трн and universities rec- 
Notion E. facts and discard the 
are not at behavior characteristics 
late to their concern because it is too 
аз the fe о anything about them. Just 
ent ¢ yth that intellectual develop- 
een Кш after a certain age has 
К isearded, so should the notion 
for s € sonality change is not possible 
Howey lege students be discounted. 
8 no ег, a college education, рег 8% 
these Fens in bringing about 
Dee mi anges, although college attend- 
T err ee facilitate this development. 
Isty she college faculties and admin- 
ee ven realize that they are not 
Do w providing a unique ex- 
айтан or their students, but that 
Tight i and the social environment 
ave more impact upon person- 


in 


есес, 
Perie 


ality development than courses and 
formal academic experiences. 
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OVERT VERSUS COVERT RESPONDING AND 
RETENTION BY SIXTH-GRADE STUDENTS 


ROBERT L. CRIST' 
Illinois State University 


A 6th-grade classroom (N = 
Both groups read a programe: 


and the other covertly. Ss were teste 
lso 6 wks. later. 


of each programed text and a 


positests there were slight but nonsignificant 
overt- and covert-responding groups. 
differences vanished, and the overt and covi 
tical scores, Time data clearly favored the cover 
replicative data were obtained with a 2 


24) was randomly divided into 2 groups. 
d text, but 1 group responded overtly 


d immediately upon completion 
On the immediate 
differences between the 
On the delayed posttest the slight 
ert groups had almost iden- 
t group. Comparable 


nd classroom (N — 20). 


T older students and adults, 
Spon рна of overt and covert re- 
avor ing to programed texts tend to 

| a covert responding because of 
Gol rr in time (Cumming & 
is а 1964; Feldhusen, 1963). 
nil к udy seeks to determine whether 
elem, r results will be obtained with 

entary-level students. 


Sit 


METHOD 
Material 


bano from the Coronet 
tem (gg) PADY were used: Our Solar Sys- 
They 25) and Latitude and Longitude (LL). 
tively Contained 331 and 351 frames, respec- 
Progra All tests were congruent with the 
factual ec and measured primarily their 
Were „c Ontent. Tests over а given program 
хен immediately upon completion 
Student pogram and 6 weeks after the last 
had finished the programs. 


Programed texts 


Sub jects 


. Sub 
St 
Th 


h 


classroom (N = 24) had a mean I 
(SD = 156). 


med both el 


a Tough ra 


1 
Th 

t 5 P 

han, ^ author wishes to acknowledge and 


ooms two groups were 


ndomization. 


think Ч 

ше marita Jones and Магу Rozum О 

Ad as alf School for their cooperation 
SSistance, 
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Procedure 

The design is shown in Table 1. It is the 
th classrooms with one exception. 
1 read SS and then LL, while 
and then SS. 


same for bo 
Classroom 


Classroom 2 read LL 
Ss read the programs 20 minutes each day. 


Upon completion of a program, each 8 
immediately took the test covering the 


program's factual content. 
Ss read only the programed texts, and 


no supplementary learning experiences were 
presented. Classroom teachers were always 
in the room, but they were uno 


btrusive and 
provided assistance only when it was re- 
quested. 


RESULTS 
Table 1 shows the group mean 
ores and mean com- 
pletion time for each of the pro- 
gramed texts. Standard deviations of 
all measures are also shown. The 
data for each classroom are treated 


separately. 


achievement sc 


Classroom 1 
Table 1 shows tha 


SS first. The covert- 1 
Group А, had а mean of 85%, while 


the overt-responding group, Group B, 
had a mean of 77%. This difference 
was not significant at the .05 level. 


t both groups read 
responding group, 


Upon completion of SS, groups 
changed response mode and read LL. 
Group А re- 


this program, 


With 
tly and 


sponded over had a mean of 


100 


Rosert L. Crist 


TABLE 1 
Summary оғ MEAN Теѕт Scores AND MEAN TIME FOR COMPLETION OF 
Hes Two PnocRAMED TEXTS 


Classroom 1 (N = 21) Classroom 2 (N — 20) 
i t itude & tem 
Tests Solar system танець: in nde d Solar sys! 
Group Group m 
м | sD! м | sp W EAE E 
т^ | 123 | 90 | 20 
diate posttest A 85° | 19: | 86 | 16 © 87^ | 12 „| 13° 
MEM MEA B 7 |26 | v- | i| p | s OE: 
d posttest A 83° | 160 80 15 Cc 88" Ы al 10° 
impen В | 88 |19 | 7^|1*| D | 3 |13 = B 
Mean time for program | А | 133: | 5ga 162 | 28 [o 1938 | 298 Eo 1^ 
completion B 218 | 53 | 137» | 49» D 273 | 59 
a Covert responding group data, 
; р 
86%, while Group B responded cov- gg, Group C scored 81, and Group 
ertly and had a mean of 79%. This scored 80. on 
difference was not, Significant at the None of the group Lower p 
05 level. the delayed posttest approache 
nificance, 
Classroom 2 nce 
Group C ach 


on LL, while Group D had a Score of 


versed and SS was studi 
increased with Group 
Score of 90% and Grou 
85%. Neither of the 
was significant at the :05 level. 


Delayed Posttest 


The second row of data in Table 1 
Shows mean scores on the delayed 
posttest. Although there Were slight 
but nonsignificant diff 


erences on the 
immediate posttest, group differences 


became even smaller on the delayed 
posttest. 

In Classroom 1, Groups A and B 
had identieal scores of 83% on the 
test covering SS. On the LI, items, 
Group A had a score of 80 and Group 
B, a score of 77. 

In Classroom 2, differences between 
£roups were equally small. On LL, 
Group C had a score of 88 and Group 
D, a score of 86. On the test covering 


Time Data 


e 
The third row in Table 1 shows 
mean program-completion nr ай 
every instance covert responding This 
less time than overt responding. me 
time difference takes its most Өх рей 
form in Classroom 2 with 85. en) 
the covert-group mean was 93 co a 
while the overt mean was 178 p 
The large SDs provide evidence 0 ei 
appreciable differences in the tim 
took Ss to complete the programs. 


CoNcLusioxs for 
Differences in retention pe ay- 
groups Tesponding overtly and the 
ertly were compared. On both ye 
immediate Posttests and the delay“ 
Posttests, no significant difference. 
tween the overt and covert E/O 
Were obtained. iffe" 
he absence of significant, di the 
ences in retention coupled with the 
appreciable saving in time of r 
covert group provide support rade 
Covert responding at the sixth-E 


À 
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leve] "e 
- Replicative data provided by a (Ed.), Educational technology: Readings 
in programed instruction. New York: 


Sec h 
ond classroom provide additional Д 
| Holt, Rinehart & Winston, 1964. Pp. 231- 


5 
“pport for these conclusions. 241 
B REFERENCES FrLDHUSEN, J. F. Тара for teaching mi- 
dp NOM A., & Gorpstern, L. S. The effect - Р Беш Бр, em 
‚ Overt and covert responding on two ы 
(Received May 10, 1965) 
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SEQUENCE OF VOCAL RESPONSE-DIFFERENTIATION 
TRAINING AND AUDITORY STIMULUS- 
DISCRIMINATION TRAINING IN 


BEGINNIN 


G FRENCH’ 


LAWRENCE MACE 
University of California, Los Angeles 


Sequence of speaking and listening in beginning French was studied 
with 108 early elementary school children. Instruction was presented by 


an automatic device in 10 daily 1 


5-min. lessons according to 4 treat- 


ments: (a) 5 speaking lessons followed by 5 listening lessons, (b) the 
5 listening lessons followed by the 5 speaking lessons, (c) 10 concurrent 
lessons alternating speaking and listening with speaking Ist, (d) con- 
current training as in (c), but with listening Ist. At the end of instruc- 
tion, and 2 wks. following instruction, the speaking-1st groups reliably 


excelled the listening-lst groups o 


The question behind this research is 
the role and importance of vocal re- 
sponse learning in the acquisition of 
language. Specifically, the study in- 
vestigates the optimal sequence of 
speaking training and listening train- 
ing in the teaching of French as a 
second language. That is, should a 
person beginning to learn French be 
taught to speak at the same time he 
is taught to listen, or should speaking 
be postponed until after considerable 
competence in listening has been es- 
tablished? 

In this experiment, vocal response 
learning occurred through a process 
of response differentiation as subjects 
(Ss) vocalized different French words 
and phrases. This learning through 
speaking was contrasted with learning 
through listening which occurred with- 
out overt vocal responding; listening 
in the experiment required only dis- 
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n a listening-comprehension test. 


crimination among French utterances 
as auditory stimuli. Three alternative 
possibilities were explored in order to 
investigate optimal sequence of Vo- 
cal response-differentiation training 
and auditory stimulus-diserimination 
training: (a) Vocal training shoul 
be deferred until after the learner has 
had fairly extensive training in which 
French utterances, as auditory stim- 
uli, have acquired considerable dis- 
criminative control; (b) vocal tram- 
ing should occur early, receiving 
special emphasis before extensive 207 
ditory discrimination training; 9? 
(c) concurrent attention should b€ 
given both to vocal response 4? 
auditory discrimination training. _, 
Nelson Brooks (1960), a leading 
proponent of the audio-lingual 20” 
proach to language teaching, таш” 
tains that listening (diseriminatio? 
training) must precede speakin? 
(sound production or vocal respon? 
differentiation). He states: 
The learner's activities may be briefly SUE 
marized as follows:... He is to hear 1140 
more than he speaks, he is to speak only 
the basis of what he has heard. . . [P. 50). 
is the ear that dominates the learning ie e 
use of speech sound. Ear training must СО, 
first.... The second objective is th! 
duction by the tongue and adjacent 
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of the spe 
speech s s а 
recognize [p. M the ear has learned to 


ene (1963) and Lane (1965), in 
еш reviews of the literature 
dp сано in second-language learn- 
айдап "e report a lack of agreement 
анг anguage specialists and а 
the н of experimental evidence on 
an jority of either discrimination 
lenguas or vocal response training in 
that ko learning. However, the view 
iake IScrimination training should 
tiaina a over sound-production 
н ees been accepted almost as 
lis at y many language teachers. 
x аа. also has been followed by 
niti of specialists in programed 
Bid uction as a basis for pre- 
Mores oreign-language programs (сї. 
on, 1960; Sapon, 1961). 

tud е and possible importance of 
learnin production during language 
suns is suggested by a series of 
i igations in speech phonology by 
ibs к (1957) and his co-workers 
Parce ogg a motor theory of speech 
ek Each phoneme of a lan- 
of ph comprises a range Or category 
Physically different sounds which 


S 
vocal responses mediate speech- 
hat tr Perception. This view implies 
lon or LU in speech-sound produc- 
heap fen be important in learning to 
ild phonemes of a language. 
Present Ten learning French in the 
Only ү experiment were taught not 
French. articulate and to distinguish 
Matio phonemes and phoneme com- 
these ns, but also to comprehend 
listening ind patterns. Speaking and 
соте comprehension were define 
Чегет E to the concepts of response 
tiation and stimulus discrimi- 
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nation. Response differentiation oc- 
curred during speaking training as 
E^ mes. to articulate different 
Mironis ped ere e stimulus 
speaking er Ко 

е ing as pictures were 
used to evoke these utterances. Con- 
sequently, through speaking training, 
pictures became discriminative stim- 
uli controlling newly differentiated 
vocal responses. A child in the experi- 
ment spoke with comprehension if he 
made appropriate utterances when 
particular pictures were presented. 
During listening training, stimulus 
discrimination was the only process 
whieh was emphasized: A French 
utterance was presented as an audi- 
tory stimulus, and Ss learned to se- 
lect a certain picture from among 
several alternative pictures. The ex- 
tent to which a child learned to select 
pictures corresponding to particular 
utterances indicated his listening 
comprehension. 

Speaking training required both 
the differentiation of new vocal re- 
sponses and also the stimulus control 
of these responses through discrimi- 
nation; listening training emphasized 
only discrimination. Sequence of vo- 
cal response differentiation and audi- 
tory stimulus discrimination was in- 
vestigated by arranging materials for 
experimental treatments with either 
speaking training first or listening 
training first. The same materials 
were also rearranged to provide 
treatments for concurrent speaking 


and listening training. 
METHOD 


Subjects 


Ss were 108 
41 first graders, 4 


children from six classrooms: 
3 second graders, and 24 
third graders. Many Ss were from Japanese 
homes (56%) or Mexican-American homes 
(21%). However, all Ss were judged by their 
teachers to be capable of following instruc- 
tions in English. None of the Ss had a 
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French language background. The school is 
located in a lower-middle-class socioeco- 
nomic neighborhood in Los Angeles, Cali- 
fornia. 


Equipment 


For presentation of stimuli, a Wollensak 
tape recorder was synchronized with a 35- 
millimeter Kodak Carousel slide projector 
by means of a Kodak Cavaleade programer. 
Groups of four to six Ss sat at tables sepa- 
rated by plywood partitions. They listened 
to taped audio materials through earphones 
and watched visual presentations which were 
projected on a screen before them. The ply- 
wood partitions and the earphones helped to 
limit social interaction among Ss. 


Materials and Procedures 


A 10-day 600-frame program of beginning 
French instruction consisted of five daily 
lessons (300 frames) of speaking training, 
and five lessons (300 frames) of listening 
training. Each lesson, 60 frames in length, 
lasted 10-12 minutes. Through the duplica- 
tion and rearrangement of the 600 frames, 
four sequences of materials were created, one 
for each of the four experimental groups de- 
scribed below. These sequences consisted of 
identical frames arranged in different orders. 

For each of the 300 speaking frames in the 
program, a single picture was projected 
upon the screen. Ss responded by making an 
utterance. Then the correct utterance was 
presented from the tape recorder. The cor- 
rect utterance depended upon the lesson 
being taught. For example, in the early les- 
sons of the program a picture of a large dog 
in a standing position might require the ut- 
terance, “un chien" (a dog), while later in 
the program the required response to the 
same picture might be “Le grand chien est 
debout.” (The large dog is standing.) 

For each of the 300 listening frames, three 
numbered pictures were projected on the 
screen simultaneously. For example, the pic- 
tures for a frame might be: a large girl in a 
standing position (1), a small dog in a stand- 
ing position (2), and a small girl in a sitting 
position (3). A French utterance such as 
“La petite fille est assise” (The small girl is 
sitting.) was presented from the tape re- 
corder, and Ss selected one of the three pic- 
tures on the screen, writing its corresponding 
number on a scoring sheet. Confirmation was 
provided as the number of the correct pic- 
ture (in this example, the number 3) was 
spoken in English from the tape recorder. 
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Ss were assigned randomly to four Hr 
ment groups, 27 in each group. The firs 
group received massed training in speaking 
followed by massed training in listening; 
five daily lessons of speaking training were 
followed by five daily lessons of Hao a 
training. The second group received masse d 
training in listening followed by masse í 
training in speaking; five daily lesson 0 
listening training were followed by de 
daily lessons of speaking training. The thir 
and fourth groups received the two ор 
of training concurrently ; that is, each oft е 
10 daily lessons included both kinds Es 
training. Each day for 10 days the рага 
group received 15-frame segments of pd 
ing training followed by 15-frame gogman 
of listening training involving the same U 
terances. The training for the fourth grouP 
was the same as for the third group, excep? 
that each day the listening-training ad 
ments came before their speaking counte 
parts. To summarize, therefore, the first вез 
third groups received speaking training fire 
while the second and fourth groups receive 
listening training first; the first and seco” s 
groups received massed training in speakin 
and in listening, while the second and thi Е 
groups received approximately concurren 
training in speaking and listening. t- 

Following the four instructional аыр 
ments, a Listening Comprehension Test по 
administered to all Ss in groups of six on ОР! 
school day. The 60 items of this test GNO 
the entire content of the program and Ж н> 
similar to the frames of the listening tr? ^ 
ing with the exceptions that no confirmatum 
was provided, and 23 of the test items cad 
cluded six numbered alternative pictus 
rather than three. Each S's score ОП bei 
Listening Comprehension Test was the Mis 
number of items answered correctly. T di 
same Listening Comprehension Test was rs 
ministered a second time 2 weeks follow! 
instruction in order to measure retention n, 

During the 2 weeks following instruct 
each S was given an individual spe of 
test with 80 items similar to the frames ig 
the speaking portions of the instructio 
program except that no confirmation king 
provided. For each item of the Spes еп, 
Test a pieture was projected on the soring 
and S spoke into a microphone recor red 
his utterance. A native French judge m hog 
each S's speaking performance, allow e 
points for the appropriateness and сотр the 
ness of S's utterance as a response ї0, ү 
picture. An S's score on the Speaking ible 


was the total points achieved from а рО®® 
324 points, 
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finds TABLE 1 
TED TREA' T А 
естен ааа AND STANDARD DEVIATIONS FOR SCORES ох TEST: 
MPREHENSION, RETENTION OF LisTENING COMPREHENSION EN 
AND SPEAKING ACHIEVEMENT i 


Listenii i istenii 
еба comprehension Retention of song зы. 
Р м SD M SD M SD 
5 peaking lessons before 
f ening lessons 45.73 13.78 49.35 12.22 152.75 79.34 
акар lessons before 5 
à ing lessons 36.25 16.28 36.64 16.97 138.79 76.35 
pom before listening in 
А me lessons 41.03 13.48 43.87 12.30 114.32 83.31 
thee before speaking in 
ame lessons 39.78 16.44 39.85 16.62 137.47 59.64 


RESULTS 


betty correlation of .92 was obtained 
Prehenei scores on the Listening Com- 
end E po Test administered at the 
es ne instructional program and 
tered on on the same test adminis- 
this о Weeks following instruction; 
Tetest Orrelation suggested that the 
om aay of the Listening 
The > d ension Test was satisfactory. 
the res reliability of the scores 
7, indi peaking Test was found to be 
аз мнр that the Speaking Test 
iso reliable. 
не А correlations of .49 or higher 
and thet ed between Ss’ mental ages 
onse 1r scores on the three posttests. 
ft RE the means for each 
Posttest wr treatment groups on each 
Table 1 Were adjusted for mental age. 
and st presents the adjusted means 
th Mer dard deviations for each of 
ex ment groups on each posttest. 
fo pay of covariance were per- 
tests on the scores of the three post- 
Ate, Using mental age as a covari- 
each ót Study the effect of sequence on 
9f the the three posttests, the scores 
Speaking-first groups were com- 


pared with the scores of the listening- 
first groups. A statistically significant 
difference (F = 4.56, df = 1/108, p < 
.05) was found for the effect of se- 
quence on the Listening Comprehen- 
sion Test administered at the end of 
instruction. This difference favored 
the speaking-first groups, suggesting 
that the speaking-first treatments were 
more effective than the listening-first 
treatments in producing performance 
in listening comprehension immedi- 
ately following the program of instruc- 
tion. 

Also, a statistically significant dif- 
ference (F = 11.95, df = 1/108, p < 
.01) favored the speaking-first groups 
when the Listening Comprehension 
Test was administered 2 weeks fol- 
lowing instruction. This finding sug- 
gests that the speaking-first treat- 
ments produced superior retention of 
performance in listening comprehen- 
sion. 

The effect of massed training, be- 
her with 1 week of speak- 
ing training or 1 week of listening 
training, was compared with the ef- 
fect of concurrent daily training in 
king and listening. No statisti- 
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cally significant differences were found 
for either the Listening Comprehen- 
sion Test or the Retention Test be- 
tween scores of the two groups which 
had massed training and scores of the 
two groups which had concurrent 
training. Also, no statistically sig- 
nificant differences were detected on 
the Speaking Test for either the effect 
of sequence or the effect of massed 
versus concurrent training. 


Discussion 


The results indicate that sequence 
of speaking and listening training 
was important in the experiment: 
The speaking-first treatment groups 
performed better than did the listen- 
ing-first groups on both administra- 
tions of the Listening Comprehension 
Test. 

One explanation for this finding is 
that speaking training produced some 
degree of listening proficiency, and, 
consequently, the groups which re- 
ceived speaking training first may 
have begun listening training with 
some part of the listening task al- 
ready learned. This advantage may 
have resulted from (a) increased 
discriminability of stimuli due to 
articulatory and verbal labeling, and 
(b) reversibility of stimulus and re- 
sponse terms. 


Articulatory and Verbal Labeling 


Articulatory responses were taught 
in the experiment during speaking 
training: A French utterance was 
presented as an auditory stimulus, 
and each S responded by voicing the 
utterance himself. This training may 
have facilitated auditory discrimina- 
tion for the speaking-first groups dur- 
ing subsequent listening training. 
Liberman (1957) proposes that an 
articulatory response may be learned 
as a label for a range or class of non- 
identical acoustic stimuli, and that 


distinctive proprioceptive stimuli from 
the articulatory response may ad 

to the stimuli of a class of speech 
sounds making them more distinctive 
or discriminable. Holland and Mat- 
thews (as cited by Lane, 19652) also 
suggest that vocalization may play ап 
important role in speech discrimina- 
tion by supplying supplemental kin- 
esthetic stimuli. The present results 
support this view that learned articu- 
latory responses facilitate subsequent 
auditory discrimination. 

Considerable research suggests that 
verbal labeling of stimuli may W7 
crease the discriminative control of 
these same stimuli under other ar 
cumstances (cf. Albert, 1959; Goss & 
Greenfield, 1958; Norcross & Spike" 
1957; Reese, 1960). Through speak- 
ing training in the present exper 
ment, different French utterances Бе” 
came verbal labels for certain visu? 
stimuli. For example, 5 learned tO 
say "La fille est assise" when & pie- 
ture of a girl sitting was presente 2 
and “La fille est debout" when ? 
picture was that of a girl standing- 
In this example, the sitting-standinÉ 
aspect of the picture became 4 е 
trolling visual stimulus for the yous 
response. This training could have 
been useful to the speaking-first ^. 
during listening training when 006 9 
these utterances was presented, a 
Ss were required to select a P1 e 
from among alternatives on the bas” 
of a particular visual property. 


Reversibility of Stimulus and P 
sponse T'erms 


Research in paired-associate lear? 
ing has demonstrated that whee в 
response, B, is learned to a stimt lus 
A, a reversibility of these ВЧ p 
and response terms occurs so t 
as a stimulus is likely to evoke 
response. This effect has bee? . 
scribed as backward or reverse 9? 
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cation in which learning A > B 
amna taniny establishes a B — A 
TED e (cf. Cramer & Cofer, 
Kield Feldman & Underwood, 1957; 
Po men & Horton, 1960; Mate 
Moe” 1965; McCormack, 1961; 
i ock, 1956). Although stimulus- 
pida reversibility itself needs ex- 
сю this effect may be used to 
fis unt for the present findings. Dur- 
pr eite training, the speaking- 
cal i learned utterances as vo- 
пее green Ese visual 
for om Subsequent, listening training 
a n speaking-first groups required 
uS id of these stimulus and re- 
ecting terms; Ss responded by se- 
When | a picture from alternatives 
an the utterance was presented as 
Й E stimulus. Through re- 
the Hity during speaking training 
ao Speaking-first groups may have 
bom listening competence prior to 
omae training. Initial listening 
PE сета could have led the speak- 
rst groups to superior listening 
Performance. 
S s results, accounted for by label- 
mes d reversibility, suggest that the 
liste effective sequence for teaching 
Кеш. comprehension is speaking 
Lor ing before listening training. 
thet generally, the results suggest 
ion early vocal response differentia- 
euin. be important in language 
elusi AB The experiment is not con- 
ous YS, however, since two extrane- 
gee ае for the findings are 
=. ble. These are: (a) possible dif- 
гесе сев in group morale, and (b) 
Ney of discrimination training. 


S А 

еар irst Group Morale 

Spoke oe speaking training, six Ss 

Sented aloud as a picture was pre- 

Si eee. the screen; social interac- 

encour nong Ss at this time may have 

Tale aoe competition and high mo- 
- Higher morale for the speaking- 
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first groups may have carried over 
during subsequent listening training 
leading to superior listening per- 
formance. In order to rule out this 
interpretation, it would have been 
necessary to administer treatments 
individually to Ss. 


Recency of Discrimination Training 


When the Listening Comprehension 
Test was administered at the conclu- 
sion of instruction, one speaking-first 
group had just completed a full week 
of listening training. Therefore, this 
group may have had less time to 
"forget" before the Listening Com- 
prehension Test than the listening- 
first group which had completed 
listening training а week earlier. Al- 
though a difference between the 
groups held up well 2 weeks follow- 
ing instruction when the Listening 
Comprehension Test was adminis- 
tered a second time, the effect of 
recency of training cannot be dis- 
counted. No control appears com- 
pletely satisfactory for obviating this 
recency interpretation. However, if 
all testing had been postponed for 
several weeks or more following in- 
struction and the same findings had 
prevailed, then an explanation 1n 
terms of recency would be less credi- 
ble. 

Differences among the groups on 
the Speaking Test would have helped 
greatly in finding the optimal se- 
quence of speaking | and listening 
training. If the speaking-first groups 
had excelled on poth the Listening 
Comprehension Test and the Speak- 
ing Test, the recency explanation 
could be discounted, and speaking 
first could be recommended generally. 
On the other hand, if the listening- 
first groups had excelled on the 
Speaking Test, while the speaking- 
first groups excelled on the Listening 
Comprehension Test, then the effect 
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ncy would be clearly sug- 
os rd more sensitive Speaking 
Test was needed in the experiment. 

In conclusion, the experiment gives 
no support to the formal tenet of the 
audio-lingual approach to language 
teaching (Brooks, 1960, P. 107) which 
recommends that training in listen- 
ing comprehension should precede 
sound-production training. In fact, in 
terms of the results obtained, the 
experiment would support the oppo- 
site conclusion; that is, for most ef- 
fective acquisition of listening-com- 
prehension skills, differentiation of 
utterances as vocal responses through 
speaking should precede training 


whieh emphasizes only auditory dis- 
crimination. 
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BLACKOUT RATIO AND OVERT RESPONSES IN 
PROGRAMED INSTRUCTION: 
RESOLUTION OF DISPARATE RESULTS’ 


FREDERICK D. KEMP ax» JAMES С. HOLLAND* 
Harvard University 


38 он measure, for the degree to which material is programed, 
Kewl ied to 12 sets of material used in previous studies of overt vs. 
eki -— in teaching-machine programs. Blackout ratios (96 
= es s which could be removed without affecting error rate) 
Ps B rom 11.1%-74.6%. The lowest 4 ratios (11.1%-25.4%) were 

r programs which previously demonstrated an advantage for overt 


responding. The remaining 8 ratios (31.0 
which previously yielded no response-mo 
have failed to find an advantage for overt respon 
rial used was largely unprogramed in that answers were 


%-74.6%) were for programs 
de difference. Many studies 
ding because the mate- 
not dependent 


upon much of the content of the material. 


inae rationale underlying programed 
перо ion demands that there be а 
гае to each item in a program. 
e i also been suggested that the 
а. ыд Should be overt to assure 
Freee occur without fail. 
saarel ae the now voluminous re- 
iie iterature contains inconsistent 
iras Íor comparisons of posttest 
After mance after overt response and 
м either the reading of completed 
AE or "thinking" the answer. 
Statice: studies have failed to find a 
Sed significant difference, but 
ae ave confirmed the advantage 
relevant ert answer. (For a review a 
ba ge studies see Holland, 1965.) 
н азоп suggested for the frequent 
€ to obtain a significant differ- 
ы 


1 
ропе esearch reported in this paper was 
and, El by the Air Force Systems Com- 
ciences еш Systems Division, Decision 
AF19(65 aboratory, under Contract No. 
The pa 8)-2404 with Harvard University. 
Document, is identified as ESD Technical 
Parna atary Report No. ESD-TDR-65-355. 
Reeds of Production is authorized to satisfy 
; The т the United States Government. 
ities frome was conducted in the facil- 
Programm, ed by the Harvard Committee on 
Y the С ed Instruction which is supported 
"Now arnegie Corporation. 
at University of Pittsburgh. 


ence has been that "poorly" pro- 
gramed ог unprogramed material 
may have been used in many of the 
studies. This point received some 
support in the results of an experi- 
ment (Holland, 1964) in which a 
program Was deliberately altered by 
leaving blank for the subjects’ (Ss’) 


completion (a) words that could be 


supplied quite easily, but for reasons 
relatively unrelated to the principal 
content of the items, and (b) words, 
which, although relevant to the criti- 
cal content of the item, could not 
readily be given by Ss. Neither of the 
experimental programs showed sig- 
nificant differences from reading com- 
pleted statements. However, the nor- 
mal program, in which the answer 
term was both relevant to the critical 
content and was to be provided by 8, 
showed а significant advantage for 
overt responding. 

To determine experimentally 
whether response mode or other char- 
acteristics of programed materials 
are important, jt is necessary to have 
an unequivocal, preferably quantita- 
tive, experimental definition of the 
degree to which the material used is 
programed. Such a measure has re- 
cently been developed (Holland & 
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Kemp, 1965). This measure, the 
blackout ratio, consists of obliterat- 
ing all material in a program which 
can be removed without affecting 
error rate. Such material is unpro- 
gramed in the sense that answers are 
not contingent upon (nor dependent 
upon) mastery of that material. The 
quantitative index is expressed as 
the ratio of the number of words 
that can be so obliterated to the total 
number of words in the material. By 
applying the blackout technique to 
programs used in previous response- 
mode studies, this experiment evalu- 
ates the claim that the frequent use 
of relatively unprogramed materials 
has caused the existing ambiguity in 
the literature regarding the value of 
an overt response. Excluding high- 
error-rate programs from considera- 
tion, programs which revealed an ad- 
vantage for overt responding would 
be expected to have a lower blackout 
ratio, on an average, than programs 
which failed to show a difference 
between overt and covert responding. 


METHOD 
Blackout Technique 


The blacking-out procedure consists of 
physically obliterating those portions of an 
item judged to be unnecessary to the cor- 
rect response. Generally, only whole phrases 
are blacked out; articles, prepositions, and 
auxiliary words are blacked out only with 
the full phrase. In calculating the blackout 
percentage, the unit used is, simply, words. 
Blanks are not counted as words and are 
never blacked out. Where diagrams, mathe- 
matical expressions, etc., appear in an item, 
they are treated somewhat arbitrarily, but 
consistently, as to what constitutes a “unit.” 
Diagrams are not blacked out (only the 
Cardiology Program contained many dia- 
grams); thus, the blackout percentage refers 
strictly to verbal content. The supplemen- 
tary materials (“panels”) are not blacked 
out. 

Figure 1 presents several samples of the 
same items from different programs, in 
normal and blacked-out forms. The range 
of the blackout ratio is apparent. 


After blacking out the material (cover- 
ing it with black crayon), the blacked-out 
and un-blacked-out versions are tested with 
two groups of Ss to validate the assumptions 
that answers are independent of the ma- 
terial obliterated. When error rate has been 
significantly influenced, the blackout must 
be revised, and the new blackout tested un- 
til error rate is unaffected. In practice, the 
first blackout is often adequate, and there 
is seldom more than one revision required. 


Materials and Apparatus 


Table 1 lists the programs used and the 
abbreviations by which they will be referre 
below. The Cardiology Program is listed on 
two rows because the Cummings and Gol A 
stein (1962) study classified the items ine 
those with verbal answers and those with 
pictorial. They had expected that overt re- 
sponding would be important for рісіопа 
but not for verbal items. However, they 
found an advantage for overt responding 1? 
both types of items. nui 

To keep the size of the experiment м 
manageable and economical limits, p- 
blackout ratio was determined usually, = 
the first 50 items, but special considerations 
led to special exceptions. Only the 17 item 
in the appendix of a report (Roe, Meer 
Weltman, & Leeds, 1960) were available 10 
the Statistics Program. The full Cardiology 
Program was only slightly longer than u 
100 items needed to obtain 50 pictor. 
items; 375 items were used for each of i 
two forms of the Analysis of Behavior © 
cause of the need for this information ! 
relation to another experiment (Hollanc: 
1964). Five programs—AB-A, АВ-В, IBM, 
MG, and Elect (Z)—used supplementary 
material in the form of panels, which We 
given to S at the appropriate times. i, 

All programs, in Biber uid gr blacked- 
out forms, were used in teaching machina 
in the Harvard Self-Instruction Room “phe 
supervision of a trained attendant. " 
AD-A and the AB-B programs were kee 
in the Harvard disc machine; all m 
were used in the Rheem Didak #501 7, 
chine. Some of the cardiology frames wi 
too large for this machine and require 
large jer space. For these items 
special booklet was arranged. 

All blackouts in the emendi study were 
performed by the senior author. An ® 
tional experiment served as a check ор 
objectivity and on the objectivity 01. d 
method itself. Ten naive Ss were bar 
using a tcaching-machine program ОП 
blackout technique and compared with 


а 


опе 
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Normal 


Blacked out 


Stat 
This 
bis aoe shows Cells 1 and 2 
ed і 
п all the possible ways they can 


be fill 
ed. C 
маза; ell 1 = 3 ways, Cell 2 = 2 


— 
SOON 


We can 
dies. н in thís illustration that 
8 
"tA с one way (ball) left to fill 
flus when cells 1 and 2 are filled. 
й » there are 3 x 2x 1 
ich i 
3 balls can #111 3 cells. 


ways in 


fard~v 

In th 
ем 

which ord "electrocardiogram" indicate 
Parts mean "electrical", "heart", 


ап ". 
record," 


AB-A 
Ifa 
Teg; 
Teinfor Ропве which has been accidentally 
c 
before = does not happen to occur just 
se 
tt wind pe cond operation of the magazine, 
A 
AB- 
Ifa 
forced do, which has been accidentally rein- 
ое 
3 not happen to occur just before 


a Sec. 
Ond o 
MM be M codi of the magazine, it 


fu 3 x 2 х 1 o 


"electrocardiogram" indicate 


In the word 
"electrical", "heart", 


which parts mean 


and "record." 


ponse which has been accidentally 


If a res 
just 


reinforced does not happen to occur 


before a second operation of the magazine, 


it will be _—- 


which has been accidentally rein- 


and blacked-out 


. 1. x 
Sample items from four programs as they appeared in the normal 


Versions 


Nother 
сеа аа with the present “blackout 
raais б га the blackout ratios for seg- 
bi "s aaéigned b programs. The blackout 
iy with y 9 of the 10 Ss correlated 
mation bet one another. The highest cor- 
288. ih Ween naive Ss’ blackouts was 
Eus 9 of Se 45 coefficients 26 exceeded .80, 
on The 9 Ss наций involved the deviant 
es another 7 ose blackouts correlated with 
ith the bl also yielded high correlations 
ackouts of the blackout special- 


ist in the present study (coefficients ranging 


from .614-.923). 


Subjects 
A total of 195 undergraduates at Harvard 
University ап Radcliffe College served as 
Ss. They were paid a flat fee for participat- 
jment, based on a rate of 


ing in the exper 
$1.50 an hour for а number of hours esti- 
mated to be sufficient for relatively slow 


workers. 
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TABLE 1 
EXPERIMENTAL PROGRAMS 
. Difference 
Previous experiment(s) No. р) m 
РО vi vs. i re 
Program and abbreviation Author of program in items |di! iem 
uA ey 
i i i i & Goldstein | 57 D 
Diagnosis of Myocardial In- | Basic Systems (unpubl.) Cummings 
The reton Pictorial (1962) : 
PO Enoia of Myocardial Tn- | BasloSystems (unpubl.) | Cummings & Goldstein | 124 
farction—Verbal (1962) г 
pees of Missile Guidance TMI (Air Force—unpubl.) | Cartier (1963) 100 
Analysis of Behavior—Version A | Holland & Skinner (1959) Holland cc 1090) 375 D 
be, illiams 
Genetics Kormondy (unpubl.)* C & Van Atta 50 ND 
Electricity U.S. Navy basic electricity | Zuckerman et al. (1981) so | ND 
Elect (Z) course, Zuckerman, Mar- 
shall, & Groesberg (1901) |. ND 
Binary Numbers Silverman & Alter (1901) Silverman & Alter (1961) 50 
perimental Programmed Course:| IBM (unpubl.) Hartman, Morrison, & Carl- | 41 ND 
IBM Sorters son (1963) 
IBM 
Basic күну Silverman & Alter (1961) Silverman & Alter (1901) 50 ND 
Elect 
Son В) а, & Functions Eigen (1961) Lambert, Miller, & Wiley 50 ND 
Sets я (1902) ND 
Anal vais of Behavior—Version B | Holland & Skinner (1959) Holland (1960, 1964) 375 
Statistics Roe et al. (1960) Roe et al. (1960) i | AP 
tat 


a This program is considerably different from a published program which was based on it. 


Procedure 


The basic procedure involved a com- 
parison between 15 Ss taking the normal 
version of the program and 15 Ss taking 
the blacked-out version. Most of the data 
were collected in the summer of 1964. At 
that time, 30 Ss took AB-A; 30 took AB-B; 
30 took Card, Elect (Z), and Sets; and 30 
took IBM, Genet, Elect (S), and BN. 
Earlier, 30 Ss took Stat (Spring 1964), and 
30 took MG in the fall of 1965. 

Preliminary data indicated the necessity 
of revising the blacked-out versions of four 
programs—Stat, IBM, Elect (Z), and Sets. 
This was done in the fall of 1965, when 15 
Ss took the revised version of all four pro- 
grams. The data were then compared with 
the data from the normal program which 
had been gathered earlier. 


RESULTS AND Discussion 


Blackout Validation 


Table 2 presents the medians and 
quartiles of the percentages of correct 
responses on the various programs 
for the normal and blacked-out ver- 
sions of each. Differences in percent- 


age points between normal p 
blacked-out versions range from Я 
(Stat) to —32 (Card V), but ^0 
program showed a significant T^ 
ence in error rate, as measured by be: 
Kolmogorov-Smirnov two-sample tes 
one tailed (Siegel, 1956). 


TABLE 2 
Percent Correct on EXPERIMENTAL 
Programs: MEDIANS AND INTER- 
QUARTILE RANGES ror NORMAL 
AND BLACKED-OuT VERSIONS 


Normal Blacked-Out 
Program terquat” 
А Interquar- i ile range 
Median ‘tile range [Medinn tile гё 
88-0 
Card P 75.4 | 54.4-86,4 | 77.2 Dri. 
Card V 68:6 003-833 | 71.8 | 9-5 94.2 
MG 93.0 | 92:0-06.5 | 93.0 | 87.2282.0 
AB-A 78.0 | 61.7-88.4 | 78.4 | 10-9 90.5 
Genet 82.0 | 77.5-94.0 | 82.0 | $y 91.0 
Elec (Z) 84.0 |73.0-94.0 | 84.0 |72: 5-92.0 
N 90.0 | 81:5-06:0 | 88.0 | 77-2-01.8 
IBM 85.4 | 807-0002 | 88 | 07.0805 
Elect (8) 82.0 | 69.5520 | 70.0 | 01.5-08-0 
Sets 98.0 | 96.0-100.0 | 94.0 | gy p-03-5 
AB-B 93.6 | 85.3-97.2 | 91.7 | $5.9-95- 
Stat 94.1 | 82.4-95.6 | 88.2 


Note.—No p’s reached significance. 
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which have showno d 

Li 
favoring over! response. AA 


=“ 
ig" 


Programs which hi 
ve sho 
mode ddtarence. PER ARRAN 


Par Cant Blocked Out 
8 


Card -P 


Сау — MG 


PROGRAM 


Fro. 9 
Sets of m 
Tatios,) 


Percenta, 
Я ge of Ње total number of 
m 

aterial. (The programs are arrange 


Blackout Ratios 


Fi 
material p shows the percentage of 
Programs еа out of each of the 
Previously hose programs which have 
ng dta shown a difference favor- 
While th responding are darkened, 
erence ose which have shown no dif- 
Yieldeq are plain. The 12 programs 
ratios (L wide range of blackout 
the pi ee Aside from 
Glisten stat te ch of this study, it 
€asure "e the blackout ratio as à 
ату оо of program adequacy сап 
Measure msiderably. In theory the 
in pr can vary from 0%-100%, 
unlikely а the extremes are most 
Centage - However, a range of 64 per- 
Whice acus among 12 programs 
explore ic not specifically chosen to 
the black e range is promising for 

is ee measure of programing. 

hat ee oe apparent from Figure 
iuh the toe sets of material from 
T east material could be 
re those which in previous 


words which were blacked 
d in rank order from lowest t 


out of each of the 12 
o highest blackout 


research yielded results favoring an 
overt response. These had blackout 
ratios ranging from 11.1-25.4, while 
the other eight sets of material, for 
programs previously yielding no dif- 
ference between overt and covert re- 
sponding, had blackout ratios ranging 
from 31.0-74.6. In short, the degree 
to which material is programed is 
related to whether or not overt re- 
sponding is important. If the response 
is unrelated to the critical content 
on which S is later to be posttested, 
it makes no difference whether, much 
less how, he responds, but if S is 
tested on things which have served as 
a contingency for correct responses, 
overt responding is important. 
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According to Flavell, functional fixedness varies with the number of 


unusual uses experienced in the training perio 
lty design. The present study focused on 


are ambiguous owing to a fau 


his limiting condition( i.e., a switch experien 
usual functions), but varied the position ( 


Switch experience. 115 female university stu 
to Flavell's hypothesis, if the switch func- 


functional fixedness was manifested 


to 1 of 5 groups. Contrary 
tion occurred Ist in the series, 


even though 4 unusual uses were subseque! 
ture or gestalt of the training experiences 
on the position of the switch function. A 


tion of the results was given. 


d. His supporting data 


ced in its usual and 4 un- 
ist, middle, last) of the 
dents were each assigned 


ntly experienced. The struc- 
was said to vary depending 
phenomenological interpreta- 


rr à 


n Р 
i igi fixedness is usually 
m РЫ with other "set" phenomena 
ед. еш solving and is interpreted 
Mine йе in which previous ex- 
proble € has a negative effect on 

abi solving behavior (Birch & 
Biber tat 1951). For example, if a 
Pliers 3 (S) uses a pair of pliers as 
confro in one situation and is then 
бу nted with a situation in which 
bond be used as a weight, the 

B pr ib plier experience will reduce 
as a va ability that S will use them 
appro eight if there is another equally 
can x ini object available which 
the m this function. Presumably, 
forces ub the pliers as pliers rein- 
Comes Fes function such that it be- 
Possible D to the object, and other 
ess ay et agen are made relatively 
out iket able. Jensen (1960) pointed 
Pretati the above “negative” inter- 
on th on places too much emphasis 
As e object and not enough on 5 

erleau-Ponty (1962) states, 

tion: a three terms in any percep- 

ject, an a on a ground for a sub- 

eged e the perceiver has a privi- 

gure e in that what appears as 

Бу, 9T ground is greatly influenced 
18 purposes. 

e meaning of a familiar object 


there д 


stands out as a figure against a back- 
ground of discoverable meanings. The 
emphasized and discoverable mean- 
ings of an object are correlated with 
its physiognomy or general shape and 
style. Very often, with regard to an 
unfamiliar object, we know what it is 
not before we know what it is, and 
this negative knowledge is based pri- 
marily on its physiognomy—its size, 
shape, weight, etc! The usual two- 
object functional-fixedness design 
emphasizes the “figure” (known) 
meaning? of one object, and the en- 
suing problem situation compares 
the ease of finding а “ground” (dis- 
coverable) meaning of the previously 
emphasized object with the ease of 
finding such а meaning in а previ- 
ously nonutilized object. The control 
group is used to demonstrate that the 
two objects have similar discoverable 
meanings in the problem situation. 
1 Husserl's views 00 typification and rele- 
uns M discussed by Schutz (1962), are 
germane. However, the extended treatment 
these ideas is not war- 


»quired to present. 
Es Һе scope of the present paper. 


ranted by t É 
2 Jt should be noted that an experience 
emphasizing & nonessential or unusual use 
ke less discoverable the 


docs not hide nor ma 
common Or essential 
(van de Geer, 1957). 


function of an object 
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TABLE 1 


AMBIGUITY OF THE PREPROBLEM TRAINING 
ASPECT OF FLAVELL’s DESIGN 


Proportion of Ss 


i iencing the No. unusual 
кешл! object in its usual uses experienced 
Ei 1.00 0 
Es -50 1 
E: -33 2 
Ey .25 3 
Es -20 4 


The purpose of the present study 
was twofold. The first purpose was to 
determine whether the significance or 
meaning of several experiences with 
an object varied depending upon the 
total structure (gestalt) of these ex- 
periences. 

The second purpose was to test a 
consequence of Flavell, Cooper, and 
Loiselle's (1958) hypothesis that 
there is a Progressive decrease in 
functional fixedness as the number of 
novel, unusual functions experienced 
in the preproblem training period is 
increased. Although Flavell claimed 
experimental support for this hypothe- 
sis, a flaw in his design renders his 
data ambiguous, Specifically, it is not 
known whether his results reflect the 
number of novel functions experi- 
enced or the position of the usual 
function in the series of preproblem 
experiences (ie., the gestalt of pre- 
problem experiences), 1 can be 
seen from Table 1 that there is an 
inverse relationship between the num- 
ber of unusual uses experienced and 
the proportion of Ss in a given ex- 
perimental group who experienced the 
usual function first. For example, 
one-half of the Ss in Flavell’s Е, 
group experienced the usual function 
first (along with one unusual func- 
tion), whereas only one-fifth of the 
Ss in his Es group experienced the 
usual function first (along with four 
unusual functions). 


In the present study the tes 
of the usual function in the prepro d 
lem series of experiences was gane 
while keeping the number of u 
uses constant (four—as in m Я 
limiting condition). If Flavell | "А 
pothesis is correct, the position o ^ 
usual function will have no em 
and the experimental groups will ү 
differ in the incidence of functional 
fixedness, Furthermore, the we ci 
mental groups will not differ {гоп s 
control group; that is, there wi the 
no functional-fixedness effect. If es 
position of the usual function is will 
portant, the experimental groups tion 
differ, and this will call into ques 
the validity of Flavell’s conclusions. 


Метнор 


Subjects 


n 

All Ss were female students enrolled 7 
an introductory educational poen fornia 
course at the University of Са ап! 
(Davis). The majority were juniors ES 
seniors intending to obtain a teaching: 
dential. Each of the 115 Ss was TE 
one of five groups in order of appear& 
the experimental room. 


e at 


Experimental Groups 


Р jor 
The control group's Ss (C), without? Hoo 
experimental training, were given a УлтоЫет 
efie Maier (1983) two-string problem 
Which could be solved only by using €^ i5 
microswitch or a small relay as & We vl 
tie to one of the strings. This group alence 
to demonstrate the functional equiv two- 
of the two available objects for te, line 
String problem and to provide the bao ixed- 
by which the presence of functiona 
ness was assessed. E), the 
Ss in Experimental Group 1 (0), 


Ў two- 
: ч a 
conventional experimental group dig m, eX 
object functional-fixedness para БА were 
Perienced the switch as a switch a 


i гор” 
then confronted with the two-string P 
lem. In the preproblem training pen all ex^ 
members of groups Ez, Es, and m in four 
Perienced the switch in its usual an. experi 
unusual functions. Ss in Group А 
enced the switch function first, third 9" 
Group E; experienced it in bare d 
middle position, and those in 
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уша it in the fifth or last position. 
ane ilization, experiences were pre- 
NRI a Ss as simple problems or tasks 
кез = were to perform by using the 
з О о S encountered the relay prior 
then e two-string problem.) These Ss were 
ler aon with the two-string prob- 
sini е five preutilization experiences en- 
E ered by the members of groups Es, 
V o, Е, follow: (a) Make a 23-volt 
of (be a simple battery circuit go on and 
бш) Connecting the microswitch into the 
Ex "s (b) draw a design comprising five 
straight lines (by using the switch as a 
or Н edge); (c) use the switch as a holder 
Small RES cleaner. (by inserting it in a 
re hole” е in the switch); (d) use the switch 
sition (by piece of cardboard in a tilted po- 
and ( x using the switch as a support); 
holder (b use the switch as a paper-clip 
y 15 (by inserting it in the space provided 
pecia switch contacts). The four non- 
aa functions were randomly ordered 
periem given series of preutilization ex- 
of the es. It is assumed that the position 

2 switch function contributes more to 
ene of the preutilization experi- 
ized ib бова the sequence of nonspecial- 
and st nctions—just as the general shape 
tribut Tucture of a pencil sharpener con- 
color Ti more to its meaning than does its 
carries hat is, the essential function (switch) 

an 3, lore weight in defining the gestalt 
ciali true of the nonessential (nonspe- 

zed) functions. 


Buffer Task 


bero, decrease any apparent connection 
e "a preutilization experiences and 
the Preutiliz S problem, immediately after 
Was ask ne experiences, each E-group 
аз possible, to count from 1-50 as rapidly 
repo А The experimenter (E) timed S 
Presented, ed her “score” to her. S was then 
embers pua the two-string problem. 
this task of the control group began with 


Two-String Problem 


Ty 
apart f 3-foot Strings were hung 8 feet 
the foo 9, а light fixture 8/6 feet above 
the Pw. S's task was to find a way to hold 
Strings at the same time. The 


soluti А 
Оп required that 5 tie an object (switch 


or re] 
swinging to one of the strings, set it 
Catch the Brab the other string, and then 


© weighted stri i 
nat veighted string on the upswing. 
tion, cae ee front of S, in a constant posi- 

e switch and the relay. All Ss 


were able to make a choice of object and 
verbalize a solution within 6 minutes. 


RESULTS 


For purposes of analysis, the two- 
string problem was considered to be a 
one-item test. Each S who used the 
switch as a weight received a score 
of 1, and those who used the relay as 
a weight received a score of 0. Conse- 
quently, the lower the group mean 
score was, the more pronounced the 
functional-fixedness effect (ie., the 
greater the tendency to choose the re- 
lay over the switch as a weight in the 
two-string problem). 

It is recognized that a dichotomous 
variable does not allow one to meet 
certain assumptions underlying the 
analysis of variance model. However, 
the study by Boneau (1960) suggests 
that this may not have seriously dis- 
torted the analysis of variance re- 
sults. Although chi-square is more 
appropriate, limited degrees of free- 
dom do not allow the multiple com- 
parisons desired in the present study 
(Lewis & Burke, 1949). Even so, the 
data were analyzed by both pro- 
cedures with parallel results. The one 
statistical test most damaging to 
Flavell’s hypothesis, the difference 
among E», Es, Es and C, was sig- 
nificant (x? = 9.91, df = 3,р < 02). 

A simple analysis of variance 
yielded an F ratio (3.12, df — 4/110) 
significant at less than the .02 level. 
Thus, there were reliable differences 
among the five groups. To specifically 
locate these differences, Duncan's new 
multiple range test was applied to the 
data (Edwards, 1960). The results of 
the multiple range test are presented 
in Table 2. The significant mean 
difference between the E, and C 
group is in agreement with previous 
studies and indicates that experienc- 
ing the switch in its usual function in- 
fluenced the frequency with which it 
was used as a weight in the two- 
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TABLE 2 
RESULTS or THE Duncan MULTIPLE RANGE 
TEST AND NUMBER or Ss IN EAcu Group 
CHOOSING THE SWITCH AND THE RELAY 


Group 
No. Ss choosing 
Ea Ei Es E: € 
Switch 4 6 10 12 14 
Relay 19 17 13 11 9 


M .17 .26 .43 .52 .61 


Note.—Any two Ms underscored by the 
same line are not significantly different; any 
two Ms not underscored by the same line 
are significantly different (p = .05). 


string problem (3? = 5.66, df = 1, 
p < .02). This particular comparison 
was made to demonstrate that the 
method of assessing functional fixed- 
ness in the present study is com- 
parable in results to those obtained 
by other investigators. 

_ With respect to Flavell’s hypothe- 
sis, groups E», Ез, Ey, and C should 
not have differed from each other. 
However, since Group Es differed 
from Es and C, it cannot be con- 
cluded that the number of unusual 
uses as such reduced the functional- 
fixedness effect in Flavell’s study; as 
indicated by Group Es, if S experi- 
enced the usual function first, func- 
tional fixedness was manifested even 
though four unusual uses were subse- 
quently experienced. 


Discussion 


As a positive phenomenon, func- 
tional fixedness reflects the ways in 
which the perceived properties or 
meanings of an object shift their em- 
phasis depending upon previous ex- 
periences with and current demands 
of the object. The functional-fixed- 
ness experiment is a way of studying 
the ease or difficulty with which the 
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properties of an object show them- 
selves; or what is the same, it is & 
way of studying the object’s psycho- 
logical strueture—the behavioral pos- 
sibilities to which its aspects refer. 
The results of the present study 
indicated that previous experiences 
with an object did not have an addi- 
tive effect which operated irrespective 
of the total structure of these experi- 
ences. When S experiences the switch 
aspect of the switch, he has the ex- 
perience that the object was prob- 
ably manufactured for this purpose. 
This experience has a positive and Я 
negative aspect. Positively, this ex- 
perience renders the object less am- 
biguous—S discovers what it 19: 
Negatively, this reduction of am- 
biguity narrows the background of 
possible meanings such that other 
functions become more remote aP! 
less discoverable. On the other hand, 
when S experiences the switch in ? 
nonspecialized function (some mino! 
or less-essential aspect), he realizes 
that this is not the function for which 
the object was manufactured, eve? 
though it realizes his purposes at 
hand. Hence, the background of dis- 
coverable meanings remains ope? 
The meaning or perception of the ob- 
ject is not compellingly polarized 1 
any one direction—the object remain" 
ambiguous. But these experiences 47° 
modified by the total structure of ox 
periences of which they are а РАТ” 
Thus, when the essential aspect of the 
switch is emphasized first (E); the 
perceived properties of the object 27 
polarized in the direction of this funt" 
tion (meaning), and subsequent ех 
periences with some of its mine 
aspects do not disturb the emphasi7e 4 
properties—the discoverable meaning. 
remain relatively hidden. When ie 
minor aspects of the object are fir 
emphasized (Ез, E4), the backgroU. 
of discoverable meanings (the m" 
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A cede of the object) is what 
B one api and this emphasis is 
nol even sr Be switch 
and to tee eventually experienced, 
ime ono MU same SOINS de is 
the bru Ss have not experienced 
ton "E in its specialized. fune- 
E, , and o. differences among Es, 
tion ie of the switch func- 
Bybee bie from the total pattern of 
leading ш experiences would be mis- 
Dn бап E es the switeh func- 
ame” ex зе said to provide the 
tive of ; xperience to all Ss irrespec- 
mental membership in a given experi- 
ever dull it is apparent, how- 
a the : the meaning or significance 
pendin witeh experience varied de- 
пели on the total structure of ex- 
part. e to which it contributed аза 
Varied оа is precisely what 
to another, one experimental group 
titan above descriptive inter- 
Contempor is not in the style | of 
MN ар American theoretical 
Pretations d several alternative inter- 
Ks Sch mus to be mentioned. 
Masterful f, (1960) has shown in а 
ness can b ashion, functional fixed- 
tive of : e viewed from the perspec- 
tion doe ransfer, However, his posi- 
Present - not seem to account for the 
he UM, Specifically, in relat- 
Dess А ee functional-fixed- 
Schulz 1 igm to his transfer model, 
ach 960, p. 70) says: 
With qp, е stimulus “relay” is associated 
18 more ‘Gees “electrical appliance” it 
eight” a cult to attach a new response 
s. pendulum bob” to this stimu- 


Since ; 
ce, 
Wa, in the present study the switch 


s : 
al GM in its electrical function in 
tial res ape groups, the differen- 
Sehu Ulis obtained indicate that 
Althou [оон is аф best incomplete. 

£^ transfer theory does not 
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seem to clarify the obtained results 
it does have the advantage of specify- 
ing experimental variables to which 
functional fixedness can be related. 
Kendler (1963) discussed func- 
tional fixedness from the perspective 
of habit competition. The habit to 
use the switch in its electrical func- 
tion competes with the habit to use 
it as a weight. By using the switch 
as a switch, the “wrong” habit is 
strengthened; thus it is easier to use 
the relay as a weight in the sub- 
sequent problem situation. Kendler 
(1963, p. 366) further states that: 
en a habit to respond in a particu- 
iven object has become very 
t, if not impossible for 
another way. 


.. Whi 
lar way to a E 
strong, it is difficul 
us to respond to it in 
This position also fails to account 
for the differential results obtained. 
Further, no S had difficulty in using 
the switch in each of the four unusual 
ways even after experiencing the 
switch as a switch (Е). 

Another possible interpretation is 
provided by Judson and Cofer (1956). 
These authors present evidence for a 
primacy effect in reasoning which 
they interpret as indicating the avail- 
ability of a habit family is modified 
depending on the first unambiguous 
stimulus encountered. If stimulus 18 
broadened to include function oF 
meaning in the sense of the present 
study, a crude parallel may be drawn 
between their study and the present 
one. Further, if the switch function 
is equated with Cofer’s unambiguous 
stimulus, the parallel is a little closer. 


However, since all experimental 
groups experienced the switch, no 
nctional fixed- 


1 degree of fu 
ness among groups would be expected. 
If, on the other hand, the experience 
af an unusual function first induces 
the habit family of “unusual uses,” 
whereas the experience of the switch 
function first induces the habit family 


differentia 
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of "electrical uses," the data of the 
present study can be accounted for, 
but functional fixedness loses Е 
meaning. That is, if S uses the switch 
in the two-string situation, it is be- 
cause the unusual-uses habit has be- 
come fixated, while if S chooses the 
relay, it is because the electrical-uses 
habit has become fixated; both choices 
reflect fixedness of different types 
(usual and unusual) of functions. 
Even if this perspective were com- 
pelling, the fact that the primacy 
effect is itself in need of explana- 
tion reduces it to little more than a 
descriptive rather than explanatory 
position. 

The present study called into ques- 
tion the validity of Flavell’s hypothe- 
sis, and it emphasized the importance 
of the structure of previous experi- 
ences in terms of the significance 
such experiences may have in subse- 
quent problem situations. If the 
phenomenological interpretation given 
seems to be directly in the tradition 
of gestalt psychology, it is because 
phenomenology has radicalized many 
of the fundamental ideas of this psy- 
chology (see, e.g, Gurwitsch, 1964; 
Merleau-Ponty, 1962, 1963). 
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ar ying шом іп ability is made difficult by (a) unreliability of 
anil (o s (b) inequality of units in which measures are expressed, 
um PERS MUNSIM M in what is measured by different instruments 
Har. T erent levels. These problems are illustrated in analyses of 
iud e Growth Study data, where 2 intelligence tests were given to 
EL. 593 pupils each yr. for 6 yrs. The maximum reliability of 
hy ectual growth appears to be about 25 or .30, An unbiased estimate 
is he obtained correlation of status with gain appears about .10, but 
Ке o e has apparently been reduced by inequalities in the 
iid „0 measure between (and within) instruments, and by the fact 
at different tests measure somewhat different functions. 


4 . 
Mara psychologists began in- 
have ing individual differences, they 
the elon interested in determining 
tellecty T to which differences in in- 
and pr «i growth patterns are orderly 
a E а from something that 
арар ON about the individual 
tent the early in life, to what ex- 
Contro] Se differences are under the 
ble rice ascertainable and describa- 
What eg conditions, and to 
tatie and ent they are inherently er- 
Muc unpredictable. 

tion is of the use of tests in educa- 
ility sei on the premise of sta- 
Will «oL hat the bright child at Age 8 
Age ту ue to be a bright child at 
evidence, 12. And there is abundant 
the case, at within limits, this is 
~that j Pub this is stability of status 
Sarily the 18 still bright, not neces- 
*apidly p he has continued to grow 
бау ag 1 S Anderson pointed out as 
of intel] 939, a good deal of stability 
sounted omal status can be ac- 
ibility ig by a model that says that 
iy а an а later date consists of abil- 
earlier date plus a growth 

Wut 
ferent vena Was presented іп a slightly 
yum as a presidential address 
i can Psychological Associa- 


(bj; Divi 
Vision 15) t of Educational Psychology 
» Chicago, 1965. 
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increment that is completely uncor- 
related with the earlier level of abil- 
ity. Bloom (1964) has developed this 
theme at some length in his recent 
book, Stability and Change in Human 
Characteristics. 

What are the substantive findings 
with respect to increments in intel- 
lectual ability? Do these increments 
represent an extrapolation of growth 
trends established at an early age? 
Or do they represent the addition of a 
random increment—random in the 
sense that it is completely uncor- 
related with intellectual status at the 
beginning of the period in question? 

Unfortunately, it is very hard to 
say. The study of change is one of the 
trickiest areas in which to work if 
one must use the fallible instruments 
that are the psychometrician's tools. 
The questions asked are simple ones: 
To what extent are the children who 
show rapid intellectual growth over 
one period of time the same children 
who had a prior history of rapid in- 
tellectual growth? To what extent 
are individual differences in rate of 
intellectual growth over an earlier 
period of time related to individual 
differences in growth rate over some 
later period? To simple questions we 
would like simple answers, and that 
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might be possible if three conditions 
could be met: having completely 
error-free measures, having measures 
expressed in meaningfully equal units, 
and having at all ages measures re- 
ferring to jdentically the same at- 
tribute of the individual. Then gains 
could be correlated with initial status, 
and it would be possible to see to what 
extent initial status is related to gain 
or gain over one period is related to 
gain over another period. In the best 
of physieal measures—that is, meas- 
ures of height or weight—these con- 
ditions are approached, but when 
we deal with mental tests, we are dis- 
tressingly far away from meeting the 
specifications. 

'Three problem areas will be ex- 
amined in turn—reliability of meas- 
urement, equality of units, and com- 
parability of function—to see how 
serious they turn out to be, and what 
the implications are for inquiry. 


RELIABILITY OF MEASUREMENT 


Much of this presentation will be 
based on some reanalyses of Harvard 
Growth Study data. In terms of ex- 
tensiveness and continuity of test- 
ing these data are in many respects 
unique, but they were gathered when 
computing facilities were neolithic by 
present-day standards, and it is the 
author's impression that they have 
never been fully analyzed. 

In the Harvard Growth Study that 
Dearborn initiated in the early 1920's, 
over 3,000 pupils in publie schools 
near Boston were studied from the 
time they entered the first grade un- 
til they left the school system—either 
by dropping out, moving away, or 
completing the twelfth grade (Dear- 
born, Rothney, & Shuttleworth, 1938). 
A great many different anatomical 
and physical measures were obtained, 
but interest here is limited to the 
group-intelligence-test results, which 
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were obtained by first one and then 
two different tests each year through- 
out the duration of the experiment. 
The present analysis is limited to 
the 593 cases out of the total of some 
3,000 for whom complete records were 
available on one intelligence test at 
Age 8, and on two tests at each ш: 
from 9 through 14. These are, 5 
course, group tests—the tests tha 
were available for use in the 1920’s— 
with whatever limitations attach to 
them. In all, 13 test scores ba 
in mental-age units were availab д 
spread over a б-уеаг period. T 
matrix of correlations was compute. 
not only among the 13 status BRE 
but also among all the differens. 
scores generated by taking the pom 
in pairs. Thus the correlation of ел 
9, with Test 10, and the correlati Е 
of 9, with 10, is available, toget ^ 
with the correlation of the differen 
between 9, and 10,—the d 
measured by this one pair of tes m 
with the difference between 92 d 
10.—the gain as measured by 
other pair of tests. 1 
With these data in hand, the nos 
lems arising from errors of un 10° 
ment will first be examined. ERES, the 
gists are all quite conscious O al 
fact that intelligence tests all 
short of being perfectly оте” 
When it is thought of at all, а Dinos 
lation coefficient of .85 or .90, OF "m 
optimistically, .95, is on ee p 
age 
"n 
or grade. And psychologists poti. 


s & 
vaguely aware that change score + is 


a 
recognized that much of what 1$ mee 
ured at any one point in tim 
mon to what is measured at we М 
point in time, and that this cO gai? 
component disappears when Ж re 
scores rather than status 5007 


А ат 
used. But how much less reliable 
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in + 
m Scores? What is the residual 
To ity of a set of change scores? 
бее [opes this question empirically, 
tudy pen the Harvard Growth 
tions Бей {а a whole set of correla- 
Coverin rie two gain scores, each 
ife, b Р the same span in the child's 
rate ч each being based оп a sepa- 
бп Testa of tests. That is, gain based 
corral i» 9: and 10; can actually be 

i Sc ed with gain based on 9» and 
Span at i of these gains cover only a 
edi year—9-10, 10-11, 11-12. 
en er 2 years, some 3, some 4, 
Bains ove pant of correlations is for 
age Gomes 5 years. The actual aver- 
interya] ations are as follows: 1-year 
“Year Fou. 2-year interval, .240; 
" erval, .266; 4-year interval, 

д : year interval, .265. 
for ieee then, that the ceiling 
еса ey of a measure of in- 
ministra, ‚ growth based upon the ad- 
tests m of two group-intelligence 
Years E with an interval of several 
aroun, сащ them, is somewhere 
idua] ' 5 or .30. The result for indi- 
gher, p ests might be somewhat 
the da ut probably not much, since 
children * being examined involve 
Carlier tection 10 years old at the 
Ing ar dang. In any study involv- 
зе that Scores, it must be recog- 
\ Р үе datum is really fragile, 
Ih ds. ue lability that appears to lie 
‚епш from .1—3, depending 
“Search tests and the interval, and 
ated aoe 1486 be planned and evalu- 

Ae cordingly. 

К revign п repeatedly pointed out 
à © Substa; Critics of growth studies, 
ne їн. s amount of error vari- 
te. Sleadin St scores leads to especially 
hj Dts d conclusions if one at- 
pi ^ correlate gains with ini- 
Droy;4" Using the same testing 
Vide b th э кё 
and th oth the initial-status 
e а € subtrahend in the gain 
ro Teason, of course, is that 
Measurement in this com- 
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mon test appears in the status score 
with a positive sign and in the gain 
score with a negative sign, and so 
generates a spurious tendency towards 
negative correlation based on the 
reversed error component. That is, if 
status is represented by Test A, and 
gain by B minus A, a positive meas- 
urement error in A results in a falsely 
high A score, but a falsely low gain. 

The present data provide an ex- 
cellent opportunity to demonstrate 
this effect empirically and to get 
some idea of its size. It is possible to 
do this by comparing two sets of cor- 
relations between initial status and 
subsequent gain. In one set, the same 
testing is used both to define status 
and to calculate gain; in the other 
set gains are calculated from the 
second set of measures. That is, if 
status is defined by Test 91, in the 
first case gain is defined as 101 minus 
9,, while in the second case gain is 
defined as 102 minus 92. These two 
types of correlations are shown in 
Table 1. 

The average value varies somewhat 
with the time interval, but a roughly 
representative figure might be —.20 
when the same test is used to define 
both status and gain and .10 when 
different tests are used in the two 
cases. Thus, the difference between 
— 920 and .10 is the magnitude of the 
spurious effect generated by lack of 


experimental independence in the 
data and the incorporation of a com- 
mon measurement error with opposite 


sign in each variable. 
TABLE 1 


CORRELATION OF INITIAL STATUS WITH GAIN 


Based on experimentally 
Based on common test independent tests 


Intervals (in yrs.) 
H -045 


Intervals (in утз.) | _ 973 
1 ` 2 
1 25387 | 5 005 
2 = 1283 | 3 1031 
1 2104} 4 "130 
—.053 | 5 .329 
5 ! 
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The experimentally independent 
correlations just cited, for which the 
typical value is perhaps .10, provide 
the best estimate of the correct and 
unbiased correlation between initial 
status and subsequent gain. It is a 
pretty minute correlation, but con- 
sidering what has just been seen 
about the reliability of gain measures, 
its small size is not too surprising. 
A rough estimate can be made of 
what it would become if it were cor- 
rected for the unreliability of both 
status measure and gain measure. 
Using .85 and .25 as working esti- 
mates of the two reliabilities, the 
corrected correlation becomes about 
.22—still pretty unimpressive. 

In the light of the results to this 
point, two conclusions seem reason- 
able. The first is that rate of gain in 
intelligence-test score over the school 
years is in large part inherently un- 
predictable because of the extremely 
low reliability of the measures of 
individual differences in gain. The 
second conclusion, which becomes 
rather tentative because of the situa- 
tion implied by the first conclusion, is 
that the nonchance component in gain 
score is positively related, but only 
slightly positively related to initial 
status. In considerable part, the fac- 
tors that produce gains during a 
specified time span appear to be differ- 
ent from those that produced the 
level of competence exhibited at the 
beginning of the period. 


EQUALITY or Units or MEASURE 


But it will be remembered that 
three requirements were stated for 
meaningful studies of individual dif- 
ferences in mental growth. So far 
only the effects of violation of the 
first condition have been considered 
—error-free measures. What about 
equality of units and homogeneity of 
the function being measured? — — 

Whenever correlational techniques 
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are applied to psychological data, it is 
assumed that the numerals in which 
the variables are expressed represent 
equal increments in some attribute. It 
is also recognized that this assump- 
tion is usually not well supported. 
But for “rough and ready” studies 
of relationship, the violation of the 
assumption usually does not hurt 
much. However, when starting to deal 
with something as fragile as a change 
score, the violation of this basic as- 
sumption becomes a good deal more 
critical. The variance of change scores 
is in some cases no more than one- 
fourth the size of the variance O 
status scores. With two measures COn- 
tributing to the gain score, any vari- 
ance arising from irregularity 10 the 
two tests’ converted score will tend to 
bulk 8 times as large as a fraction 9 
the gain-score variance. . "m 
Furthermore, any systematic poen 
in the size of the units, such as n 
produced by a test ceiling too 10%. 
permit full expression of the abilities 
of those tested, may produce 2 sys 
tematic distortion of the relationship 
between status and gain. With 800 
ceiling effect, those initially higher 
will not be fully measured by the е 
at the later ages, and will necessari y 
show smaller increments. Thus, 1 jos 
Harvard Growth Study all test I А 
sults were presented in mental-2£ 
units. However, for the older 
of pupils—13- and 14-year-olds 
mental-age for 
brighter fraction ha 
extrapolated values, based ae 
ever type of assumption та " 
кошы eel to the test ape 
of the 1920’s. (The test manuals = 
notably terse оп the procedures wee 
to set up the scale of ments. 
equivalents.) If the extrapola 


s 


ive" , 
were made somewhat conservati ii y 


as not to overstate a level © АРЫ 
that was arbitrary and not en 
cally based on the age schem 


а а 
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in 
uA usn the effect would be to 
able stimate the scale values for 
the md at the higher ages, with 
Pupils " that gain scores for these 
PA ould be depressed. Even a 
ibat раша depression could 
nate th lally reduce or even elimi- 
eios e necessarily small correlation 
ШТ, n initial status and the highly 
So iable gain score. ` 
ata ha evidence from the present 
Problem cre on the reality of the 
from ile unit of measure comes 
andara comparison of means and 
mental deviations (expressed as 
in the hem of the two tests given 
in Table 2 à year. These are shown 
бт thee Differences in parameters 
cannot р nearly simultaneous tests 
environ: e attributed to growth or 
ü he ы impact or anything 
лере, characteristics of the tests 
es es. Thus, it is found that the 
УЫ? test means of 9-10 and 
but ee agree quite well; 
deviatio the corresponding standard 
bern of 20 and 13 months, one 
ee i the other by 50%. For the 
gree ear the standard deviations 
X5 —both are 23 months—but 
differen means are 11-5 and 10-3, à 
is а же of 14 months. At 11 there 
two Rn difference between the 
in the E and a 5-month difference 
ar, thoy p istandard deviations. Simi- 
з ү less dramatic, differences 
e units a 12, 13, and 14. Clearly, 
Quivalent the different tests are not 
same oan” One suspects that the 
ments ^j be said for different seg- 
teg, ^ 01 the score scale for a single 


The 
for gg anging standard deviation 
rent variables makes for 


quit, 

Status atic correlations between 
Veal and gain. Simple algebra re- 
Size e fact that the sign and the 
Score the correlation of Status 
Scor with gain from Score B to 


epends upon the relative 


сл 


ТАВІЕ 2 
MEANS AND STANDARD DEVIATIONS OF 
PAIRS oF TESTS GIVEN IN THE 


Same YEAR 
Test No. 1 Test No. 2 
Ss 

M SD M SD 
9-year-olds | 118.0 | 19.9 | 119.7 13.3 
10-year-olds | 137.4 | 23.0 | 123.2 | 22.7 
ll-year-olds | 153.6 | 24.4 | 148.1 | 29.4 
12-year-olds | 164.6 | 22.9 | 164.5 | 26.3 
13-year-olds | 167.7 22.0 | 176.1 | 24.7 
1i-year-olds | 178.1 | 23.9 181.4 | 23.0 


size of scrac and sn» . 'The formula is 
as follows: 


$ TacSc — ТАВ$в 
A(C-B) = Wed + asi — Sracistc A 
Sc! + 58° — 2rnc828c 


Thus a change in scale unit, resulting 
in a change in standard deviation, 
havoc upon one's results. 
n our data, in the 
wide range of values that were ob- 
tained for the correlation of status 
with gain. Though a value of .10 has 
been suggested as best representing 
the relationship, this average value is 
based on a range of correlations 
from .64 to —.43 for different specific 
inations of initial test and sub- 
sequent test to define gain. This vari- 
ation illustrates the point that the 
result obtained is spectacularly re- 


sponsive to distortions of the unit of 
he initial and 


can wreak 
This appeared, i 


comb 


measure as between t 
final test. f I 
One particularly misleading pas- 


time that is sometimes engaged in, 
even by those who should know better, 
is to study the relationship between 
an initial IQ and the change m IQ 
during some subsequent period. The 

f this procedure become 


absurdities 0 
the 1Q is thought of as a 


clearer if п 
standard score keeping the same 
mean and standard deviation at each 


aracteristic that is explicitly 


age—a ch 
tly developed tests and 


true of recen 
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approximately true of those developed 
in an earlier day. Ase 

When the standard deviations of 
two sets of scores at different points 
in time, B and С, are forced to be 
equal, and the correlation of a third 
score, À, with the difference between 
the two is studied, the sign of the 
resulting correlation depends solely on 
the relative size of the two correla- 
tions raz and rao. Thus, if it is de- 
sired to study the correlation be- 
tween IQ on Test A at Age 8 and 
change in IQ írom 9-12, the only 
way that the correlation can be posi- 
tive is for the 8-year-old test to cor- 
relate more highly with the 12-year- 
old test than with the 9-year-old test 
—a fairly improbable and unnatural 
situation, If the correlation with the 
later test is lower, correlation with 
the change score will necessarily be 
negative. 

By eliminating from the score 
scale the differences in standard devi- 
ation at different ages, that which is 
the essence of growth is eliminated— 
the greater variability of specimens 
as they mature. Imagine a group of 
adults whose heights and weights 
showed no greater standard devia- 
tions than those of newborn babies! 
And a statistical treatment that auto- 
matically excludes greater variability 
in intellect as we go from birth to 
maturity is equally absurd. Thus, 
any studies of intellectual growth that 
use as а score scale an IQ with a 
uniform standard deviation from age 
to age are, by that very fact, mean- 
ingless for any analytical study of 
the growth process. The constraint 
that has been put on the score scale 
assures distorted results. 


Unirormity or Function MEASURED 


A basic assumption, if growth is to 
be studied, is that the same function 
is being measured throughout the 


range of ages being studied. But 
when intellectual performance over а 
range of ages is studied, it is quite 
usual for the particular test to change 
as we go from an earlier to a later 
age. The change in test is likely to be 
accompanied by some change in the 
functions measured by the test, de- 
pressing the consistency between early 
and late performance, with a result- 
ing depression of the relationship be- 
tween a status measure based on Test 
A and a gain measure based on Tests 
B and C. е 
The Growth Study data permit & 
look at this problem because Ше 
were several age spans during which 
the same test was used in successive 
years. In all, six comparisons Wer? 
possible between the correlation s 
some test, A, used in one year Ms 
Test A in the following year and i 
correlation with a different test, ^ 
the following year. In the same yari 
six comparisons were possible of hae 
relations of some test, A, in а Quel 
year with Tests A and B in the А 4 
ceding year. The same-test до 
tions with a 1-уеаг interval averai 
to exactly .80, the correlations W! 
another test in the following убагын, 
75, and with another test in the рї! 
year, .73, or a pooled average O 
for the correlation between differ 
tests. we 
This means that, while idee 
tests showed 64% of common Vit) 
ance from one year to the next, W 786 
change in test the common d q 
was only 55%, or only 86% as en 
Nine percent of the total variance aoe 
14% of the stably measured vari? 
was unique to the specific test. 


ent 


Е as- 
If this change in function Peach 
ured was cumulative, so ha, 14% 


time a new test was introduce? 7. 
of what was measured by the prenne 
was replaced with new v^ take 
unique to the new test, it woul 
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ed eia Shifts before measures of 
slicks = unrelated functions were 
way à Of course, this is not the 
Test re Operate. As we go from 
DEK oi т B {о С, it is likely that 
far as os at is unique to Test C so 
in com ach В is concerned is shared 
likely og with Test A. It is un- 
Top in ata clear picture of further 
shift Рв with a successive 
ed be found. Thus, in the 

correlat, data, Dearborn C at Age 11 
ge 12 es .808 with Dearborn C at 

lle o yi only .760 with Haggerty 

еа ay ge 12; .693 with Haggerty 

oman qi AER 13, bub 685 with 

with Ter roup at Age 13, and .736 
is no mE ey Group at Age 14. There 
ove ES ence of further drop as we 

case th to the third test, in this 

м е Terman Group Test. 

80 to a noton in correlation from 
Comparin that has just been seen in 
oes Sof Same with different tests 
Опсе a „Seem like very much, but 
or imer reference to the formula 
Score “ae between one status 
Shows +} another change measure 
even a nat anything that introduces 
the der extraneous reduction in 
Status a. of the correlation between 
the thee the terminal measure of 
even е Sequence can extinguish or 
tion bety erse the sign of the correla- 
ough . status and change. Thus, 

different he lower correlation between 
tan ds tests looks pretty unimpor- 
Tobug ^, Ve look at the reasonably 
Signifion Scores, it becomes very 
the "rr When we are dealing with 
from do agile change scores. À drop 
to .74 in intercorrelation 
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could reduce the correlation between 
status and gain from .20 to .10, cutting 
it in half, or from .10 to 0, existin- 
guishing it completely. 


CONCLUSION 


Thus it has been seen that change 
scores are very sensitive creatures, 
very responsive to changes in score 
scale and to changes in the func- 
tion being measured. So perhaps we 
need to modify the earlier conclusion 
that seemed appropriate from the 
Harvard Growth Study data—the 
conclusion that only a small fraction 
of the gain in later competence can 
be thought of as arising from a con- 
tinuation of early growth trends. Per- 
haps it must be concluded that, unless 
and until scales are developed that 
are truly homogeneous in the func- 
tions that they tap at all levels and 
are truly expressed in equal units, we 
will have to forego serious attempts 
to give a quantitative answer to the 
simple but tantalizing question: To 
what extent will the children who 
have grown rapidly in intellect up 
to the present moment continue to 
grow rapidly in the future? 
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CONDITIONS OF PREQUESTIONING AND RETENTION 
OF MEANINGFUL MATERIAL 


D. E. BERLYNE 
University of Toronto 


Exposure to 28 quotations and subsequently to names represented as 
possible authors was used with 8 classes of high school students, num- 
bering 225 in all. Retention of the names was significantly improved 
by requiring Ss to record guesses when 1st exposed to the quotations 
and by presenting all the authors’ names after all the quotations 
rather than each name immediately after the corresponding quota- 
tion. Expressed curiosity about the alleged authors was significantly 
correlated with retention scores and retention scores with intelligence- 
test scores, but curiosity was uncorrelated with intelligence. Theo- 
retical implications are discussed. 


An experiment carried out some 
years ago (Berlyne, 19542) showed re- 
tention of factual statements to be 
enhanced by prior exposure to ques- 
tions that the statements answered. 
'This finding had been predicted from 
some hypotheses about epistemic curi- 
osity (Berlyne, 1954b, 1960, 1963, 
1965a, 1965b), conceived as a state of 
heightened drive or arousal that is 
relieved by acquisition of knowledge. 
It had been postulated that the ques- 
tions would generate epistemic curi- 
osity, which would be relieved after 
subsequent exposure to the corre- 
sponding statements and internal re- 
hearsal of them, and that reinforce- 
ment from the consequent curiosity 
reduction would increase the likeli- 
hood of recall when the questions 
were presented again during the test 
phase. In addition, it was found that 
the subjects (Ss) exposed to the fore- 
questionnaire expressed more “ex- 


1 This investigation was supported by Re- 
search Grant MH-06324 from the National 
Institute of Mental Health, United States 
Public Health Service. The author wishes 
to acknowledge the able collaboration of 
M. A. Craw and J. L. Lewis, who acted as 
experimenters. He would also like to thank 
A. B. Patterson, principal of the David and 
Mary Thomson Collegiate Institute, Scar- 
borough, Ontario, who kindly arranged for 
his students to serve as subjects. 


tended curiosity” than control Ss; 
that is, when asked at the end of the 
experiment to indicate which of the 
animals figuring in the material they 
would like to know more about, they 
checked off more animals. 

Rothkopf (1965) has embarked 00 
a rather similar line of research. < 
has ascertained that the presentation 
of questions at various points 1n t * 
learning process can not only increas 
retention of facts answering the 9065 
tions, but also produce a generalize? 
improvement manifested in the p^ 
tention of other facts. Rothkopf, ! 
interpreting his findings, uses а rath? 
different theoretical approach, holding 
that questions give rise to “inspect”? 
behavior” or “mathemagenie 
sponses,” which facilitate mean! 
learning. " 

The experiment to be reported Зас 
was meant to carry further IP "^ 
direetions the study of the effects es 
prequestioning on retention ap" 0 
extended curiosity. First, effects ns 
three variables relating to conditi he 
of prequestioning were studied. i 
variables were (a) whether OT putt 
has to guess the answer to each n 2 
tion, (b) time elapsing between ©. 
secutive questions, (c) whether, and 
exposed to all the questions firs ; the 
then to all the answers or whethe 


eful 
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ена 5 answer is given after 
LS F4 estion. Second, the experiment 
7 to probe effects оп re- 
inel н two characteristics of mul- 
Е яме ле baee that had been 
n previous experiment 
П уы! $ 1962) to affect the degree 
ree of n е uncertainty and the de- 
ges eported euriosity. These were 
Stave E ee of alternative an- 
и (b) the degree to which 
Ton ез to select different answers 
Strength. 


METHOD 


Stimulus Material 


The sti 
ee material, identical with that 
Jerlyne 1962 experiment just mentioned 
tions from 2), consisted of 28 brief quota- 
Merican hk prominent in English or 
200 years E d or literature of the last 
Braphed in rua quotations were mimeo- 
10n Were th ooklet, and under each quota- 
ostensibly he names of two or three men, 
name of ple authors. Actually, the 
em, so th correct, author was not among 
knowledge at possible effects of previous 
ach name could be excluded. Coupled with 
e mumbar cs a number, purporting to be 
100 who had of high school teachers out of 
Tue autho eed it to be the name of the 
three nan f the 28 quotations, seven had 
tion of fictit and a highly uneven distribu- 
two Name: itious teachers’ guesses, seven had 
had уге: апа an even distribution, seven 
Seven had names and an even distribution, 
i two names and an uneven dis- 


trib 
ution, Т 
ere rando hese four classes of questions 
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two versions. The quotations associated with 
three names in one version were associated 
with two names in the other and vice versa. 
The quotations associated with uneven dis- 
tributions in one version were associated 
with even distributions in the other, and vice 
versa. 

„Опе of the possible authors associated 
with each quotation was arbitrarily chosen 
to be presented as the “correct” author. At 
the end of the booklet, there was an answer 
sheet, bearing lines numbered from 1-28, 
followed by a sheet for the Extended Curi- 
osity Test, bearing the names of the 28 “cor- 
rect” authors. 


Subjects and Experimental Design 


Ss, 225 in all, consisted of four Grade 12 


and four Grade 11 classes. The school Ss 
attended practiced “streaming” or “homo- 
geneous grouping,” so that the A, B, C, and 
D “forms” in each grade represented differ- 
ent levels of ability. 

A 2 x 2 X 2 between-Ss design was used, 
the three dimensions being (a) Guess versus 
No-Guess instructions, (b) Immediate versus 
Delayed provision of answers, and (c) a 10- 
second versus а 20-second interval between 
consecutive questions. The assignment of 
combinations of treatments to classes is 
shown in Table 1. It will be seen that the 
difference between ability levels is con- 
founded with the double interactions and the 
difference between grades is confounded with 


the triple interaction. 


Procedure 

After the experimenter (E) had been in- 
troduced and Ss had filled out biographical 
information on t oklet, 


they were told 
being asked to: 


he cover of the bo! 


-day to partici- 


mly int н 
hen erspersed. 
Proxi the booklets i You are 
m were given to Ss, ap- | Dare 
“ely equal numbers н each of pate in а research project that 18 being 
TABLE 1 
ALLOTMENT OF TREATMENTS 
Grades Forms 
A B с р 
No guess 
No guess Guess Guess | 1 
H о ана Delayed Immediate Dany ed 
10 sec. 10 sec. 20 sec. 20 sec. 
ё uess No guess Guess _ 
i: | e ud Ped dun Delayed Immediate 
20 sec 20 вес. 10 sec. 10 sec. 
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cted by the Psychology Department 
ps University of Toronto. It concerns 
ways to improve teaching methods. This 
experiment is concerned with ability to 
recognize authors of quotations. As you 
know, experts can often tell who wrote 
something from what they know about the 
background of various authors or from the 
style of the quotation itself. But at times 
even an expert will be surprised to find 
out who the true author is. 

Before you open your booklets, let me 
tell you a little bit about this project: ~ 

Some time ago we presented 100 English 
and history teachers with a number of 
quotations. Two or three possible authors 
were listed after each quotation, and the 
teachers were required to choose which 
person they thought was the correct au- 
thor. 

You people are going to be shown some 
of the quotations that were given to the 
teachers, together with the names of two 
or three possible authors for each. You 
will also be given information about the 
teachers’ choices. Printed opposite the 
name of each possible author will be the 
number of teachers who chose it as the 
true author. 

The procedure to be followed was then 
described. 

E read aloud each quotation twice while 
Ss were required to follow it on the booklet. 
There was then a pause, during which Ss 
read the names of the possible authors and 
the numbers of teachers’ guesses from the 
booklet. In Guess groups, they were also to 
underline the name they thought most likely 
to be that of the true author, but this in- 
struction was omitted for the No-Guess 
groups. In Delayed groups, all the quotations 
were treated in this way, and then Ss turned 
back to the first page and followed while E 
read aloud the quotations again in the same 
order, stating the name of the “correct” au- 
thor after each one. In Immediate groups, E 
read the quotation a third time, followed by 
the name of the "correct" author, immedi- 
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ately aíter the pause and before going on to 
the next quotation. In 20-second groups, the 
interval between successive quotations in the 
Delayed condition or between exposure to a 
quotation and hearing it read а third 
with the name of the “correct” author in the 
Immediate condition was emen and in 
he 10-second groups it was 10 seconds. 

' po last deorrect” author had been 
given, Ss heard E read the quotations, n = 
different randomized order, a fourth шш 

and had to attempt to guess the name of t * 
"correct" author for each one, writing ! 

down on the appropriate numbered iem D 
an answer sheet. Finally, Ss were require Я 
turn to the Extended Curiosity Test yen 
and to check the names of the authors tha 

they would like to know more about. 


RESULTS 


Retention Scores 


The mean Retention scores for ue 
experimental conditions are Bm. 
Table 2. These scores were subje¢ re 
to analysis of variance and also ii- 
analysis of covariance with inte 
gence-test scores, derived from sd 
Dominion Learning Capacity Test A 
concomitant variable. Data сс 
seven Ss, for whom intelligence" ^" 
scores were not available, were «d 
cluded from the analysis of cov? 
ance. m 

The within-group correlation Jli- 
tween Retention scores and inte 5 
gence-test scores was .22, bir ой 
significant at the .01 level. The Чу 
layed condition produced signifi ste 
better retention than the Immec 
condition: F(1, 200) = 6.05, Р к 
in the analysis of covariance, 


TABLE 2 
MEAN RETENTION AND EXTENDED-CURIOSITY SCORES (WITH STANDARD 


DEVIATIONS IN PARENTHESES) 
Mean scores Guess No guess Immediate Delayed 10 sec. nee 
9.2 
Я 9.5 8.0 7.0 10.4 8.5 71) 
Retention as) | em | &1 | 6m | @т® | Фр 
- 7.1 6.3 6.6 6.8 $ 3.72) 
ie ОЕР (3.55) | (8.88) | (366) | @.54) | G4 | € 
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F 

al 217) = 661, p « .05 in the 
ae of variance. The Guess con- 
Be produced significantly better 
tion: P than the No-Guess condi- 
oe ; 200) = 30.01, p « .001 in 
Р(1 21 a ysis of covariance and 
analysi ) = 333, р < .001 in the 
бп € of variance. Mean Reten- 
age were higher with 20-second 
* os 5 than with 10-second inter- 

; but not significantly so. 


E 
"ttended-Curiosity Soas 


T 
йеп id 2 shows the mean for Ex- 
ane б es scores (the data for 
in all s lost). The differences are, 
B Ret es, in the same direction as 
ilar aon scores, but analysis of 
Simiicun, ows none of them to be 
D 
With ended-Curiosity scores had 
їшї КҮР correlations of .10 (not 
Scores am x with ^ intelligence-test 
ention в 22 (p « 01) with Re- 
ion of pos The partial correla- 
With Ret, xtended-Curiosity scores 
ЫШ Saeed scores, controlling for 
P < 01), est scores, came to .2 


S 
tention te comparisons. When Re- 
Mine 200 Паша with three 
With thos e answers were compared 
: when for items with two answers 
distributi those for items with even 
Suesseg ons of purported teachers’ 
items wu compared with those for 
Means wy, uneven distributions, the 
There сн all very close together. 
difference therefore no significant 


If a Discussion 

tetention accept the hypotheses that 
quriosit.. increases with epistemic 
depends and that epistemic curiosity 
е Periorit; n conceptual conflict, the 
Condition Of retention in the Guess 
Ag that can be explained by assum- 

an instruction to guess in- 
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creases the absolute strengths of all 
the complexity response tendencies 
and thus exacerbates competition 
among them. The more pronounced 
difference found between the Delayed 
and Immediate conditions is more 
surprising and harder to interpret. In 
the Delayed condition, there is a 
greater time interval before initial 
exposure to a particular question and 
subsequent exposure to jts answer. 
There is also material intervening be- 
tween the two events, in the form of 
questions coming later in the series 
and answers to questions coming 
earlier, whereas, in the Immediate 
condition, there is none. It is possible 
that there is some kind of ‘incuba- 
tion” effect: Epistemic curiosity may 
rise progressively the longer S has to 
wait for the curiosity-relieving infor- 
mation, so that the later the informa- 
tion is received the more curiosity it 
will reduce. Presumably, if this pro- 
gressive increase occurs, it will de- 
velop only for a certain time and 
then give way to а decline. There 
may, however, be other explanations. 
Research devoted to effects of vary- 
ing time intervals between exposure to 
questions and receipt of answers and 
of varying quantity and nature of 
intervening experiences would seem to 

be called for. | 
The significant positive correlation 
between Extended Curiosity and Re- 
tention could be explained in either 
of two ways. One possibility is that 
higher extended curiosity is produced 
by better retention, because the ex- 
perience of recalling answers to ques- 
tions with relative ease makes one 
favorably disposed to the subject 
matter or reinforces epistemic (knowl- 
edge-seeking) behavior directed to- 
ward the subject matter. If this ex- 
valid, one would expect 


planation were Val" ; 
any factor that increases retention 


to increase extended curiosity, and 
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these factors would include the De- 
layed and Guess treatments. They 
would also include intelligence, which 
was positively correlated with Teten- 
tion. However, the differences in Ex- 
tended Curiosity due to these treat- 
ments, while in the same direction 
as Retention scores, were not signifi- 
cant. Nor was the correlation between 
extended curiosity and intelligence. 

The alternative explanation is that 
retention depends on at least two 
distinct factors, among others, one be- 
ing intelligence and the other being 
some kind of motivational disposition 
(cf. the individual differences in curi- 
osity among school children studied by 
Maw & Maw, 1961). Extended curi- 
osity would then reflect the latter but 
not the former. The significant par- 
tial correlation between Extended 
Curiosity and Retention, with intelli- 
gence kept constant, fits this explana- 
tion. 

The absence of significant within- 
Ss differences in Retention scores due 
to the two determinants of subjective 
uncertainty fails to confirm indica- 
tions from earlier work that retention 
increases with reported epistemic curi- 
osity. Any effects of these factors are 
likely to have been weak in compari- 


son with those of other factors (for 
example, content of questions) so that 
more sensitive tests of them will be 
required. 
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REVERSALS IN CHILDREN AS A FUNCTION OF 
MIDLINE BODY ORIENTATION" 


ROBERT W. ZASLOW 
San Jose State College 


The hypothesis that fixed arm movements with respect to the body 


midline are important fi 
evaluated with 197 norma! 


their teachers as having reversal pro! 
and letters, A crossover procedure, in 
tion of S was altered by forcing S to wri 
body midline, was successful in correc 
mal reversal Ss and 79% of the brain- 
a control group of 30 normal rever: 
sals in the normal writing position, 


actors in written reversals in children was 
land 11 cerebral palsied children selected by 


blems with respect to numbers 
which the normal writing posi- 
te on the opposite side of the 
ting reversals in 63% of the nor- 
damaged Ss on the 1st trial. In 


sal Ss, instructed to correct rever- 
only 13% corrected (p < 1001). 


SS 


_ reversal of letters and num- 
in E constant instructional con- 
childre the teachers of school-age 
station, Research involving the ori- 
that the of figures in space indicates 
ure is he proper orientation of a fig- 
Youn, секир not appreciated by 

ind potis until 5 years of age 
rather } 941). As a consequence, the 
etters high frequency of reversals of 
Second and numbers in the first and 
to be es is generally considered 
ever, if e pen of immaturity. How- 
gre ui ie a be emphasized that a 
Not see any reversals in children do 
ion. lie to reflect error in orienta- 
een од of other theories have 
chi] ur us to explain reversals in 
Dersist qu pec reversals which 
mary rough and beyond the pri- 
are Nod Among these theories 
orced t edness, in which а child is 
hanq ir d write with a nondominant 
Cause of emonstrates a confusion be- 
Travis a lack of hand dominance 
allied 4 & Johnson, 1934). Closely 

at кер this position is the proposal 
Sult of ersals are produced as а re- 
(Travis es cerebral dominance 
M 9 Johnson, 1934). Other in- 


А 
This 
fro study was sponsored by a grant 


the 
Research mea Jose State College Faculty 


vestigations attribute reversals to 
central nervous system damage in 
those children with brain injury who 
seem to show a high incidence of re- 
versals. Perceptual and figure-ground 
disturbance are said to account for re- 
versals in brain-injured children. 
Emotional disturbance is also posed as 
a causative factor in children’s rever- 
sals. The Bender Visual Motor Gestalt 
Test indicates that reversals and rota- 
tion of design are frequently found in 
both brain-damaged and emotionally 
disturbed subjects (Koppitz, 1964; 
Pascal & Suttell, 1951). 
Of these theories only those involv- 
ing emotional disturbance, where per- 
ceptual and spatial disorganization 
may occur, and central nervous system 
damage, which also produces percep- 
tual and spatial disturbance, have 
been adequate in certain cases of re- 
versals. Immaturity may not account 
for persistent reversals after the age 
of 6, except perhaps ina small propor- 
tion of children. The writer feels that 
the greater incidence of reversals 
found in boys, usually attributed to 
immaturity, may be due to greater 
motor resistance arising from anxiety 

or from jnhibition of aggression. 
e of the body midline 


The importance [ i 
in understanding reversals in children 
has been neglected, and the writer sug- 
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gests fixed motor responses as à major 
etiological factor in persistent reversal 
cases. These motor responses appear 
to be based on primitive behavior pat- 
terns and are, to a large degree, inde- 
pendent of volitional or cognitive con- 
trol Emotional factors may reinforce 
the rigidity of these motor patterns. 

From this point of view, it follows 

that for some children reversals should 
be corrected on the motor level rather 
than on cognitive or emotional levels 
exclusively. Radler and Kephart 
(1960) refer to the body midline as a 
factor in reversals and orientation 
problems only tangentially and do not 
extract the full significance of motor 
movements in writing with respect to 
the body midline. The fact that more 
boys have persistent reversal prob- 
lems than girls and show greater 
muscular aggressiveness, thrust, ac- 
tivity, and emotional resistance sup- 
ports the position of the writer. 

The purpose of this study is to 

suggest the importance of the body- 
midline function in a method for cor- 
recting reversals in normal, retarded, 
and brain-damaged children. This 
method follows from the above theo- 
retical considerations about hand 
motor patterns with respect to writ- 
ing movements to and away from 
the vertical midline body axis, and 
the suggestion that certain primitive 
motor patterns become fixed with re- 
spect to the body midline and may 
persist independently of the child’s 
intention to produce a correctly orien- 
tated figure. Thus, certain motor pat- 
terns may be responsible for the 
production of persistent reversals that 
are resistant to change by the usual 
instructional methods. 

In experimenting with a number of 
reversal cases some time ago, the 
investigator observed that often a 
letter or number that had been re- 
versed with either hand was imme- 


diately corrected if the writing hand 
was positioned so that it crossed the 
body midline. In other words, if the 
right hand produced a reversal in 
normal writing position on the right 
side of the body, the reversal be- 
came corrected if the right hand was 
placed on the left side of the body 
midline without moving the body. 
This oceurs because the motor pattern 
now is a mirror image of the reversal 
in the normal position. It is self-evi- 
dent that a motor pattern going away 
from the midline in the normal writ- 
ing position will become reversed 
and go toward the midline when the 
writing hand is positioned on the 
opposite side of the midline. . 

This technique was used in ? 
reversal study involving first-, s¢¢- 
ond-, and third-grade students, 28 
well as a small number of bram- 
damaged children. These children 
were evaluated with respect to pos 
sible correction of reversals using 
the midline-crossover procedure. 


METHOD 


Subjects 


Total subjects were 208 children, all eval- 
uated using the crossover procedure, w 
were selected by their teachers as having 
reversal problems with respect to numbers 
and letters. The original experimental group 
consisted of 178 children (167 normal 2? 
11 ungraded cerebral-palsy children); il- 
control group consisted of 30 normal chi 
dren. In the original experimental group: 


52% were age 6, 38% were age bs 
cerebral-palsy children accounted for 


higher age range) ; 64% were from Grade - 
27% from Grade 2 (the cerebral-palsy en ^ 
dren were ungraded); 61% were boys, 
were girls. Of the original experimer to 
group of 178, only 80 (45%) were foun? gj- 
actually have reversal problems in ап НА 
vidual testing situation, leaving а oup 
of 80 subjects in the experimental gr a 
(including the cerebral-palsy EI." a 1 
the control group consisting © ” 
subjects. Of the 80 subjects in the final ех 
perimental group, 53% were age 6, 5% 

age 7; 61% were from Grade 1, 29 
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G А 

QN e were boys, 40% were girls. In 
age 7; 59% were boys, 41% were gira (eee 
Table’ ere boys, 41% were girls (see 


Procedure 


Ea í р 
йер „Жав given a large sheet of 
ine re: Sd divided down the center by a 
body presenting the midline of the child's 
numbers ES i was requested to write the 
to present etters which had been found 
and lette reversal problems. These numbers 
ing НУ to be written in а descend- 
right si de order along the midline, on the 
children the midline for right-handed 
for left-hand on the left side of the midline 
o determin ed children. This step was taken 
reversal vidi if the child would correct the 
completion p crossing the midline. Upon 
anded child the right-sided set, the right- 
e midline was told to cross his hand over 
nose lette to the left side and again write 
reversed The and numbers which he had 
Possible twi е examiner took care to prevent 
Would cha; isting action of the body which 
midline, 226° the direction of the child’s 
Utilized the id original experimental design 
controls, A experimental groups as their own 
. At a later date a separate control 


Broup o 
as an deve reversal cases was evaluated 
The Corto] Pacem on the original design. 
their тея group was instructed to repeat 
sed letters and numbers three 
Acr, Gr TABLE 1 
хркими, AND Sex or Ss IN FrNAL 
MENTAL AND CONTROL GROUPS 
A , 
Квад, ëvel Experimental group] Control group 
Ss 
No. 55 | 9% 55 | No. 5 | % Ss 
Аре Е 
6 
7 42 53 20 | 68 
8 28 35 9 | 30 
9 6 8 1 2 
10-17 : p: | ш | == 
| 8. [ — e 
Grade 
1 
2 54 | 61 | 20 | 68 
t 20 | 25 | 9 | 30 
Ngraded E === 1 2 
Sex 14 — — 
M 
# 48 | oo | 18 | 59 
32 | 40 | 12 а 


TABLE 2 


Frequency OF REVERSALS AND 
CORRECTED REVERSALS 


Reversal frequency 
Categories of Analysi: pes T 
nalysis otal 
no. Ss 
£1| 2:1 S] 3: ] o8 
E = 
% Ss in each frequency 
category 
Experimental group | 44 | 32] H| 7| 6 80 
Control group 531331131613 30 
% Ss correcting іп rela- 
tion to total num- 
ber Ss correcting 
Experimental group 49|26|14| 9| 2 51 
Control group 50 | 25 | 25 | — | — 
% Ss correcting in rela- 
tion to total num- 
ber Ss in each cate- 
gory 
Experimental group | 71 | 52 78 | 83 | 20 
Control group 12 |10110] —| — 


times while maintaining the same side of the 
normal writing position. 


RESULTS 


Experimental Group 


It was found that many of the 
children selected for the study did 
not demonstrate the reversals that 
they usually produced in a classroom 
situation. In a face-to-face individ- 
ual testing situation only 45% of 
these subjects reversed symbols. In- 
attention factors may be operative 


in the classroom and greatly reduced 
in the one-to-one testing situation. 
re was & sharp de- 


As expected, the 
crease of reversa 
and grade levels. 
reversals were fou 
this result is in agree 
to be anticipated from, e 
As Table 2 indicates, the majority 
of all subjects in the experimental 
group reverse only one or two sym- 
21). The results show 


bols (M — 
that 6096 of all the reversals produced 


by the experimental subjects were cor- 
rected by the midline method. When 


]s in the higher age 
The majority of 
nd in boys (60%) ; 
ment with, and 
the literature. 
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the total number of subjects correct- 
ing reversals by the midline method 
is considered, 63% of the subjects in 
the experimental group corrected on 
the first trial (see Table 3). : 

Results show that in the brain- 
damaged (cerebral-palsy) children, 
reversals were distributed through- 
out the various grade and age levels, 
and a larger proportion (72%) of 
this group reversed than in the total 
population. It is in this group that 
we would expect to find more of the 
fixed motor patterns that could be a 
causative factor in reversals, and it 
is also in this group that the greater 
percentage of corrections (79%) were 
shown. These results hold promise 
and should be investigated more ex- 
tensively in brain-damaged children. 

The most frequently reversed num- 
bers, in order of occurrence, were 2, 
7, 3, and 9. Only 9% reversed 6, 8% 
reversed 5, and 6% reversed 4. The 
number most frequently corrected was 
2 (25%), closely followed by 7 (19%) 
and З (17%). We may note that num- 
bers 2, 7, and 3 require an initial 
movement away from the midline so 
that it may very well be difficult for 
some subjects to initiate an arm move- 
ment away from the midline, while 
other subjects have trouble initiating 
movements toward the midline as in 
numbers 6 and 9. 

The most frequently reversed let- 
ter was Z (13% of the cases), closely 


TABLE 3 


Tora, REVERSALS AND 
CORRECTED REVERSALS 


Exper Control 
Categories of Analysis p patie 
Total no. Ss __ 80 30 
% Ss correcting rever- е m 
sals o: 
Total no. reversals | 161 60. 


% reversals corrected 


followed by b (12%) and C (10%). 
Although certain reversals were more 
common, there seemed to be a wide 
distribution of reversals among all 
letters, and the corrected letter re- 
versals also fell in a wide distribu- 
tion. The most frequently corrected 
letter was C (12%), while B, D; ©, 
Z, b, and d were each corrected by 796 
of the subjects. It may be noted that 
Z reverses its direction toward and 
away from the midline, which may 
account for the high frequency of 
reversals. In most instances reversals 
appear to result from a fixed motor 
set which initiates movements toward 
the midline. 


Control Group 


In contrast to the experimental 
group, the control group showed а 
total of only 10% correction of rever- 
sals on the first trial. The average 
number of reversed items was 2.0, ап 
the total number of children correct- 
ing reversals was 4 (13%). The ef- 
fectiveness of the crossover metho 
is indicated by the superior results of 
the experimental group in correct- 
ing reversals. The chi-square test 
showed the significance to be at the 
.001 level. 


Discussion AND CONCLUSIONS 


A suggested method for correct- 
ing reversals is as follows: Have the 
right-handed ehild eross the midline 
with the right hand and write the 
symbol which he reverses. If the 
symbol is corrected by this e 
dure, then, while the hand is sti 
crossed over the midline, the abih 
should make a row of the symbo!, 
moving toward the right and cross: 
ing the midline, so that the end 0 
the row falls into the proper ya 
position on the right side of tP 
midline. 


This technique 


midline-crossover 


WRITING REVERSALS IN CHILDREN 


tig to pe fairly successful in cor- 
in Пор агре number of reversals 
dics © nal and brain-damaged chil- 
quite n the first trial. While it is 
"nu provable that reversals may 
nidie rom several factors, this tech- 
correcti appears to be effective in 
dictio ing a core factor in the pro- 
Minds of reversals—namely, move- 
he Ae Pi away from the midline. 
inn я, ortance of the midline func- 
clarify nay be elaborated upon to 
pe le aid in certain instruction 
It has b that the teacher encounters. 
certain een noted in this study that 
iin children are capable of pro- 
only £ T вра numbers and letters 
the botto ey begin to make them from 
his pr m rather than from the top. 
tain fix M eama compensates for cer- 
бё тү motor patterns which pro- 
guided m if the child is fully 
Teaser er, PE example, if à 
e to ed at the bottom rather than 
ment » a tendency to make a move- 
produ, mara the midline wil not 
Cur if it А reversal that would oc- 
manner, Were written in the normal 
T : 
n " рейин (cerebral- 
Sals udi wa produced more rever- 
of более, tained a higher percentage 
mal 01160401 reversals than the nor- 
Pected Meis This result is to be ex- 
Patterns ecause of the fixed motor 
sult. Thi resulting from cortical in- 
Motor : Study suggests that fixed 
e im play a major role in 
Palsied = reversals of cerebral 
ances ma : dren; perceptual disturb- 
Yevers x not be the major cause 
Possible к in these subjects. It is 
estalt € rotations on the Bender 
ese subj est, so characteristic of 
Proper 1.1805 may result from im- 
Sheet ,Dlacement of the protocol 
le de respect to the midline of 
designs lect. The subject may orient 
According to his own midline 
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as projected on the sheet of paper in- 
stead of the actual postu Uu hs 
paper. This may occur when the 
paper is placed at an angle to the 
body midline. 

Left handedness complicates the 
entire problem of midline orientation 
and may account for the greater num- 
ber of reversals found in left-handed 
children. The left-handed child may 
position the writing hand across the 
midline in a manner and direction 
that increase the complexity of the 
writing motor act with respect to mid- 
line orientation, thereby increasing 
the possibility of left-right type re- 
versals of symbols. In addition to 
the mere mechanies of writing with 
the left hand, there is an emotional 
and personality factor. The litera- 
ture suggests that left handedness it- 
self is a symptom of negativism and 
oppositional behavior at an early age. 
Toilet training produces more aggres- 
sion and resistance in boys (Sears, 
Whiting Nowles, & Sears, 1953). 
Left handedness, stuttering, and re- 
more common in boys, may 
o this period of childhood 
bond of motor 


versals, 
be linked t 
through the common 
resistance resulting from emotional 
dynamies, usually anxiety and hos- 
tility. These early and primitive mo- 
tor patterns may be quite resistant 
to change. À child may resist with 
arms and hands in such a manner 
that he is not aware of the resistance 
and, as а consequence, has neither 


guilt nor control over his reversal 
g. Thus, the fixed 


tendency in writin 1 
motor pattern in question may have 
ell as a motor con- 


an emotional as We". ‹ 
ditioning basis. Anxiety may either 


fixate responses OT produce excessive 
vacillation of such responses with re- 
spect to midline orientation. 

An analysis of the literature lends 
support to the theoretieal position 
underlying this study and reinforces 
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the impression that boys are not in- 
ferior to girls with respect to basic 
skills. Boys are not more immature; 
they are different. Werner (1940, 
1957) postulates that the most primi- 
tive concepts of spatial relationships 
involve dynamic visual movement 
(such as in а tapping test), while 
the more highly refined concepts of 
space are more abstract and imper- 
sonal and involve a minimum of, or 
no, movement. These findings suggest 
a causal link whereby boys may suf- 
fer greater impairment of concepts of 
spatial relationship because of mus- 
cular aggressiveness or hyperactivity 
as an expression of anger. 

Goodenough (1931) found that 
there is a much greater frequency of 
anger and aggression in boys than 
girls after the age of 2 and that this 
sex difference is maintained up to 
the age of 7 and beyond. Thompson 
(1962) remarks that this fairly con- 
sistent sex difference in the frequency 
of anger episodes is interesting and 
enigmatic at our present state of 
knowledge. The writer believes that 
the overt expression of anger in 
boys is one of the major causative 
factors in fixing of rigid motor pat- 
terns to or away from the body mid- 
line. 

Michael, Zimmerman, and Guil- 
ford’s (1951) experimental work pro- 
vides evidence that spatial relations 
continue to be coordinated to the 
human body even into the elementary 
school years. They also found that a 
factor analysis of 15 different tests 
of spatial relations and visualization 
revealed definite superiority in the 
boys. Hildreth (1932), in her excel- 
lent monograph on the subject of re- 
versals, contends that the greater in- 
cidence of reversals in boys is a logical 
consequence of the learning process 
and an indicator of jmmaturity, an 
argument that is not supported by 
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the findings that show boys to be 
superior in motor ability and spatial 
relations. 

The developmental importance of 
the body midline to writing has been 
suggested by Ames (1948). She 
found that children, when writing on 
a sheet of paper, tend to move from 
the lower right-hand corner of the 
paper up to the center and then up 
to the top center, and that these ori- 
entation changes are sequential and 
orderly. 

Speed of reaction and activity lev- 
els in boys may account for rever- 
sals. Koch’s (1934) study, which in- 
volved a factor analysis of activity in 
nursery school children, found that 
speed of reaction is often classified 
as nervousness or aggressiveness by 
observers. This speed may actually 
be inhibited by the classroom teacher 
and produce motor resistance. The 
writer also feels that the superior 
speed of response found in boys 
could produce more spontaneous 
primitive motor responses to ог away 
from the midline than would appear 
when reactions are more deliberate. 
Pomeroy (1938), studying children 
5y5-years-old, found that extroverted 
boys are speedier than introverte 
boys, while girls fell in the middle 
range of the extroversion-introver- 
sion scale. А 

The writer feels that these studies 
generally support the theoretical post- 
tion taken with regards to the sig 
nificance of the body-midline axis 20 
the production of reversals in children 
and points to the need for the inves 
tigation of this phenomena for tho 
purpose of motor retraining as well 8 
theory construction. On the basis о 
this research, primarily a pilot stu i 
it would be profitable to enlarge t 
study to include a larger samp. € = 
brain-damaged subjects as wal s 
mentally retarded children. A 
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sl research is required to deter- 
now с, this technique can 
“a ied the instruction of number 
= er writing in the classroom 

permit the permanent correction 


of rever; M 5 
sals in 
cases. persistent reversal 
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EFFECTS OF DELAY OF INFORMATION FEEDBACK 
AND LENGTH OF POSTFEEDBACK INTERVAL 
ON LINEAR PROGRAMED LEARNING' 


FREDERICK J. BOERSMA? 
Department of Educational Psychology, University of Alberta 


The study attempted to determine the independent effects of delay 
of information feedback (IF), post-IF delay, and sex in a complex 
learning task in which Ss learned a series of symbolic logic rules by 
programed instruction. 56 Ss served individually in a factorial design 
which combined 2 delays of IF (0 and 8 sec.), 2 post-IF intervals 
(0 and 8 sec.), and a sex factor. The data did not support the predic- 
tion that human learning would be facilitated by increasing the post- 
IF interval, nor did it yield a significant delay of IF effect. But it 
did show that a simple explanation in terms of the independent ef- 
fects of delay of IF or post-IF is inadequate. A competing-response 
interpretation of delay was formulated with respect to error scores, 
answer latencies, and the delay variables, 


Though the study of experimental 
delay goes back nearly 50 years, and 
though research has demonstrated the 
retardation effects of delay on lower 
organisms, it has failed to generate 
consistent findings with humans (Ren- 
ner, 1964). Thus, the complete ac- 
ceptance, without qualification, of 
delayed-reinforcement or information- 
feedback (IF) generalizations, which 
Suggest that a few seconds delay be- 
tween response and reinforcement, may 
mean the difference between maximal 
learning and no learning whatever, is 
unwarranted in human learning. 

Bourne and Bunderson (1963) have 
suggested that the controversial de- 
lay effects in human learning may be 
due to the fact that experimenters 
have failed to control for confound- 
ing effects of the intertrial interval 
(ITI) with post-IF delay and have 
failed to consider methodological fac- 
tors as task complexity. To control 


1 This paper is based on a dissertation of 
the кзз submitted to the Graduate 
College of Michigan State University in 
partial fulfillment of the requirements for 

degree. 
t ha pm is indebted to Bernard R. 
Corman for his valuable assistance and ad- 
vice in the conducting of this study. 


for such confounding effects, pn 
menters developed a design whic 
permitted them to vary these vari- 
ables orthogonally while maintaining 
an average ITI between treatments 
They found that delay of IF did éd 
retard concept attainment, but tha 
increases in the post-IF interval le 
to significant decreases in error scores: 
and the post-IF effect became more 
prevalent as concept complexity 10" 
creased. 

Brackbill and Kappy (1962), how- 
ever, have suggested that discrepar 
cies may be attributable to the fac 
that the subjects (Ss) can mediato 
the delay interval in some cases, ÞU 
not in others. Yvonne Brackbill (1964, 
unpublished)? provided some supp?! 
for this hypothesis in a serial learn- 
ing study. She found that an interpo- 
lated task initiated during delay Te 
tarded performance and conclu tip 
that decremental effects associate 
with delay increase when Ss canno 
reinstate symbolically some form 9 


*The manuscript, “The Impairment а 
Learning under Immediate Reinforcement 
can be obtained in dittoed form from di- 
author at the University of Colorado Me 
cal Center. 
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d response aftereffect. Her study 
foundi Ot, however, control for con- 
TF Acie effects of the ITI with post- 
complexity. did it consider task 
t oe appears that variables other 
sidered ay of IF per se must be con- 
regardi when making generalizations 
Draen ng the effects of delayed rein- 
that t ent on human learning, and 
tions 3 pica! reinforcement generaliza- 
simple ге: on lower organisms and 
to UP may not be applicable 
from Ment school learning. Results 
more clo ies employing complex tasks 
ing, as sely related to school learn- 
Which co in programed instruction, 
of dela ntrol for confounding effects 
es y with the ITI are now needed. 
ent Mode purpose of the pres- 
of Bour Y was to extend the work 
using a S and Bunderson (1963) by 
school | er closely resembling actual 
tas ee programed-learning 
Purpose symbolic logic. A secondary 
op и: to examine verbal reports 
o behest questionnaire responses) 
9f the le rom Ss at the conclusion 
that, mach fae” task. It was surmised 
in ме data would provide valuable 
of delay. р concerning the influence 
learnin ^ reinforcement on a complex 
ational ies and shed light on medi- 
n cesse. 
Ee delay c E uu 
ped o differs from traditionally 
In that ic He research on delay 
Materia] e differential complexity of 
Sitateq ei the program neces- 
€Xperim e use of subject- rather than 
ervals K oro feedback in- 
Mental ut this change in experi- 
S Wero ра was imperative if 
блесна оо adeguate LE, Ji was 
error sco that all correlations between 
léspective ty and IF intervals for the 
ble, aud treatments would be negligi- 
that consequently this new 


Tep 


141 


variable would not confound the 
study. The hypothesis was supported. 
. Figure 1 represents the temporal 
intervals common to this study. The 
ITI was defined as that period of time 
between response and the presenta- 
tion of the next stimulus. The delay 
of IF interval referred to the amount 
of time Ss waited after responding be- 
fore receiving IF, while the post-IF 
interval referred to the amount of 
time Ss waited after receiving IF be- 
fore being exposed to the next frame. 
The IF interval was synonomous 
with the answer-latency variable. 

It was predicted that the post-IF 
interval would be the crucial variable 
in this complex learning task, and 
that the effect of delayed IF would be 
negligible. It was further surmised 
that the data would resemble Bourne 
and Bunderson's (1963), since for all 
practical purposes ihe experimenter 
(E) had controlled for confounding 
effects of ITI with the delay vari- 
ables and had used a difficult (com- 


plex) learning task. 


PROCEDURE 

Subjects 
Fifty-six volunteer educational psychol- 
S women, served 


ogy students, 28 men and 2 
individually as Ss in this ex; 
median age Was 20 years, an! 
tive grade-point average, 2.66. i 
randomly assigned to the four experimental 
conditions (14 per experimental treatment) 
with sexes equally represented in each treat- 
ment. 
Task and Apparatus 

p " " x6 die 

The linear program 1n symbolic logic 

veloped by Evans, Glaser, | and Homme 
(1962) was modified for use 1n this experi- 


periment. Their 
d their cumula- 
These Ss were 


DIF IF PIF 


pug ue 


ITI 


Fic. 1 Temporal intervals common to the 


present study. 
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ment. The final form of the modified pro- 
gram consisted of 7 rules to be learned in 
78 frames. Each frame was programed ac- 
cording to the dd System (Evans, 
Homme, & Glaser, 1962). 

The "apparatus consisted of an MTA 
Scholar teaching machine, a Hunter timer, 
and an Esterline-Angus recorder. The teach- 
ing machine and timer were wired so that 
E could control the delay intervals. The 
Esterline-Angus recorded latency to response 
and amount of time Ss spent examining the 
correct answer. The program was advanced 
by the internal mechanism of the apparatus 
when Ss pressed a button. 

Ss were allowed to control the amount of 
time spent on each frame. After Ss con- 
structed a response, they were informed of 
the correct answer either immediately or 
after an 8-second delay. The amount of time 
spent examining the correct answer was also 
controlled by Ss. After this second self-de- 
termined interval, they were exposed, either 
immediately or after an 8-second delay, to 
the next frame. In all treatments, the stim- 
ulus material and S's constructed response 
were withdrawn from view during the delay 
intervals. The lengths of the respective de- 
lays, that is, the delays of IF and post-IF, 
were predetermined as a function of the 

treatment conditions. 

Each S received a 5-minute break midway 
through the program and before adminis- 
tration of a criterion test after completion 
of the program. This test consisted of 10 
true-false questions, 10 recall questions, and 
8 deductive-proof questions; an open-ended 
questionnaire was also included in the bat- 
tery. Average total-program running time 
per S was approximately 2 hours, and varied 
from 1 hour, 25 minutes to 2 hours, 35 min- 
utes. 

The dependent variables were two error 
scores, а response-latency score, and an an- 
swer-latency score. Number of errors indi- 
cated to what degree Ss responded correctly 
to program and criterion-test items. Re- 
sponse latency referred to the average 
amount of time spent by Ss responding to 
programed items, and answer latency to the 
average amount of time spent by Ss ex- 
amining the corect answers to these items. 

Scorer reliability (test-retest) over a 7-day 
period was .963 for program errors and 
1981 for criterion-test error scores. The re- 
liability of the criterion test was .843 (KR 
21), whereas the correlation between pro- 
gram and criterion error scores was 875. Re- 
corder reliability (interjudge) was .994 for 
response latencies and .998 for answer la- 
tencies. 
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Design 


À 2 x 2 x 2 factorial design was 
used with 2 delays of IF (0 and 8 
seconds), 2 lengths of post-IF inter- 
val (0 and 8 seconds), and a sex fac- 
tor. The experimental design per- 
mitted E to vary the effects of delay 
of IF and post-IF orthogonally, to 
maintain an average intertrial inter- 
val of 8 seconds between these 2 fac- 
tors, and to control for possible 
massing effects between the various 
treatment conditions. 

The 4 treatment conditions were 
identified as follows: 0-0 indicated 
complete self-pacing by Ss (i.e. no 
delay of IF or post-IF delay); 0-8 
indicated no delay of IF, but an 8- 
second post-IF interval; 8-0 indicated 
an 8-second delay of IF, but no post- 
IF delay ; and 8-8 signified an 8-second 
delay interval before and after ex- 
posure to correct answer (ї.е., before 
and after IF). 


RESULTS AND DiSCUSSION 


The analysis of variance on program 
error scores revealed that all тат 
and interaction effects were insignifi- 
cant except the delay of IF x post- 
IF interaction (F = 5.640, df = 1/48 
p < .05). Mean number of errors for 
Treatment 0-0 was 7.62; for Treat- 
ment 0-8, 11.62; for Treatment 8-0, 
13.30; and for Treatment 8-8, 9-52- 
Similar, though not as pronounced; 
findings were obtained for the analy- 
sis-of-criterion error scores. In ue 
analysis, the delay of IF x post-l 
interaction failed to reach the .05 leve 
of significance; however, it did excee 
the .10 tabled value of the F distri 
bution (F = 2.821, df = 1/48) 
Figure 2 represents graphically the de- 
lay of IF x post-IF interactions 10" 
program and criterion error scores. di 

A significant sex effect was fou? 
for the analysis of average respons? 
latencies indicating that females spe” 
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1 i " 
cape responding to each frame 
4.085 y^ than did males (F — 
rad Eat de 05). All other 
cmd ped interaction effects for re- 
ial atencies were insignificant. A 
Mises analysis was performed on 
оа program — answer-latency 
did th his breakdown revealed, as 
шей previous analysis, that female 
than s were on the average shorter 
EID E e (F = 4908, 
deni 48, p < .05). And, more im- 
signifies it revealed that Ss spent a 
cir ee greater amount of time 
treatments correct answers when their 
layed, r 5 were associated with de- 
back pie than immediate, feed- 

1on = эө 
ТЕ ш (Р = 12.676, df = 
Satie on the open-ended 
egorize Paus were independently cat- 
havioral by two judges into be- 
fatigue classifications indicative of 
(competi (physical) and frustration 
ive mete responses). Representa- 
are: анна of fatigue responses 
wards th seemed to get tedious to- 
couraged. end”; “I felt somewhat dis- 
tiredne and became aware of my 
kept "m towards the end"; and "I 
veri ‘When will it ever be 

Examples of frustration 


i EFE PROGRAM ERRORS 
4 --- CRITERION ERRORS 


PLA 


$ T7-----2 
ае 
oor" 


MEAN NUMBER OF ERRORS 
о 


о- 
DIF 8-DIF 


Барша representation of the 
it post-IF interaction for pro- 
erion-test error scores. 


Fia, 2 
as of IF 
am and er 
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TABLE 1 
PERCENTAGE OF FREQUENCY OF REPORTED 
FATIGUE AND FRUSTRATION RESPONSES 
oN OPEN-ENDED QUESTIONNAIRE 
as л FUNCTION оғ TREATMENT 


Type of response 


Treatment 
DIF Post-IF Fatigue Frustration 
0 0 43^ 0 
0 8 0 57 
8 0 7 57 
8 8 7 57 
a N = 14 per cell. 


“J wished the machine 


responses are: 
t was frustrating 


would hurry up—i 
when you had to wait”; “It was some- 
what confusing as the rules multi- 
plied”; and “I found myself wishing I 
had a reference to all rules in relation 
to each other at one time for study.” 
95% agreement between 


There was 
the independent classifications of the 
two judges. ‘able 1 gives percentage 


i open-ended question- 


breakdowns 0 
a function of treat- 


naire responses as 


ment. Р 
The present study fails to support 


Bourne and Bunderson's (1963) sup- 
position that the post-IF interval is à 
critical variable in human learning. 
Where Bourne and Bunderson found 
that increases in t-IF interval 
were accompanie 
error scores, the e gt 
delay in this experiment were negligi- 
ble. But the position of Bourne and 
Bunderson that delay of IF is not à 
critical variable in complex human 
supported. 
Both in Bourne and Bunderson’s 

tudy, delay of IF 
re varied orthog- 
1 difference be- 
se two studies Was that 
d Bunderson’s tasks were 
a constant level of diffi- 
h that IF could be presented 
{ a second (a light 


peaked at 


culty such ! 
jn à fraction O 
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flashed on the apparatus), and, as a 
result, E completely controlled the 
ITI. In the present study task, how- 
ever, complexity varied throughout 
the program and necessitated the use 
of subject- rather than experimenter- 
controlled IF, and consequently com- 
plete E control of the ITI was im- 
possible. 

Thus, Bourne and  Bunderson's 
treatments are relatively more massed 
than are those used in the present 
study. The effects of massing should 
be most prevalent in their self-paced 
treatment, 0-1, and consequently this 
treatment should be associated with 
maximum error scores. Massing effects 
should not occur, however, under self- 
pacing, such as that used in the pres- 
ent study, where S controls the length 
of the IF interval. Moreover, the rank 
order between comparable-delay 
treatment means in Bourne and Bun- 
derson's and the present study should 
be similar since massing should dis- 
sipate when delay of IF and/or post- 
IF delay is introduced into the ex- 
perimental situation. 

This interpretation was supported 
by a comparison of comparable treat- 
ments between the two studies.t Where 
Bourne and Bunderson found their 
0-1 treatment to be associated with 
the largest number of errors, in the 
present study the comparable 0-0 
treatment produced the fewest num- 
ber of errors. An identical rank order 
was found between all treatment 
means where delay intervals were 
similar. 

Even though massing per se did not 


* Bourne and Bunderson’s five-dimensional 
concept treatments were used for this com- 
parison. These treatments were chosen since 
the complexity of these tasks more closely 
resembled that of the tasks used in the pres- 
ent study, and since Bourne and Bunderson 
found the post-IF effect to be more pro- 
nounced for complex concepts. 


appear to be a crucial factor affecting 
performance in the present study, 
4396 of Ss in the self-paced treatment 
did complain of physical fatigue due 
to length of program, whereas 796 or 
less made similar complaints in each 
of the delay treatments. If fatigue due 
to massing is operating during com- 
plete self-pacing, as it appears to be, 
its influence should be less than that 
associated with self-pacing in which 
E provides IF in a fraction of a sec- 
ond. 

The answer-latency data indicated 
that a significantly greater amount 
of time was spent by Ss examining 
correct answers when treatments were 
associated with delayed- rather than 
immediate-feedback conditions. It ар- 
pears that when IF was delayed, the 
number of interfering (competing) 1°- 
sponses which S had to process and/0! 
eliminate while examining the correct 
answer was increased, and conse 
quently it was necessary for him to 
spend more time examining correct an- 
swers under delayed- than immediate- 
feedback conditions. Support for this 
interference interpretation was ob- 
tained from the analysis of question- 
naire responses which revealed that 
57% of Ss in each of the delay treat- 
ment complained of frustration 
(competing responses) while working 
through the program. 

In sum, it is surmised that: (9) 
Competing responses are initiated dur- 
ing delay; (b) these responses 2e 
overtly characterized in terms of frus- 
trative behavior; (c) these response? 
become conditioned to the same stimu? 
as does the to-be-learned response, 27 
consequently acquisition time 20" 
creases as a function of increments m 
the delay variable; and (d) Ss 87e 
able to use the IF interval to proces? 
and/or eliminate many of the com 
peting responses elicited during delay- 


DELAY or INFORMATION FEEDBACK 


ores it appears that if delay of 
ies been extended to the point 
m. e Ss could no longer cope with 
UT dompedisg responses, a signifi- 
Ead пау, pi IF effect would have 
"usen ams Similar competing-response 
Pee ds ran of delay have been 

pee by Spence (1960) and 
à ide Adelman (1955) for ani- 
iar а апа by Denny? for hu- 
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EIGHT PROFESSORS—A STUDY ON COLLEGE 
STUDENTS’ PREFERENCES AMONG THEIR 
TEACHERS 
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University of Iowa 


Students’ preferences for different 


AND 


HENRY F. DIZNEY 
Ezamination Aids Center, 
Kent State University 


types of college professors, Adminis- 


trator, Socialite, Teacher, and Researcher, from each of 2 sources, edu- 
cation and arts and sciences, were studied. 87 graduate students, 131 
seniors, and 82 juniors in secondary education responded to a Likert- 
type inventory, the Eight College Professors. Type and Source of 
professors were significant sources of variation. Teacher, Researcher, 


Socialite, and Administrator were 


preferred in that order, and educa- 


tion was preferred to arts and sciences as a source. Professor Type X 


student Level interaction was also 


significant. Researcher was the only 


type to gain in preference from juniors to graduates. No remarkable 


sex differences were observed. 


McGee (1962) has discussed the 
“disorganized” nature of the Ameri- 
can university and attributes that 
condition to its threefold (creation- 
conservation - dissemination) func- 
tion. Others agree in principle if not 
in terminology (e.g, Knapp, 1962; 
Pusey, 1963). Shifting importance at- 
tached to these functions by various 
elements of the university are sug- 
gested by both Knapp (1962) and 
Caplow and MeGee (1961). Some of 
the concomitants of such change 
appear disturbing. Jacob (1957) sug- 
gested that the entire function of the 
teaching profession has fallen into 
low esteem among students. A more 
sober picture was drawn by Kerr 


(1963) : 


The undergraduate students are restless. 
Recent changes in the American university 
have done them little good.... There is an 
incipient revolt of undergraduate students 
against the faculty; the revolt that used to 
be against the faculty in loco parentis is now 
against the faculty in absentia [p. 103]. 


Faculty members may be in absen- 
tia in ways other than physical. Since 
academic advancement often seems 
more related to nonteaching than 


teaching activity, it may be expected 
that the press of other demands сап 
adversely affect the quality of 10- 
struction. On the basis of observation 
it seems that four rather distino 
"types" of faculty personnel exist. 
For this study they were named Ad- 
ministrator, Socialite, Teacher, aP 
Researcher. Three of the four types 
tend to be in absentia because of the 
very nature of their interests. 
Evidence indicates that the per- 
ceptions of students regarding teacher? 
are more a funetion of internal frame? 
of reference than of concrete teacher 
characteristics (Gage, 1953; Katt, 
1962). Knapp’s (1962) summaries 
point out that students tend to hav? 
sharply defined, consistent images a 
professors and emphasize persona- 
social rather than intellectual char; 
acteristics in ratings of “effective 
teachers. i. 
It may be expected that: (a) on 
dents, given a choice, would pre! 
as their professors Teachers 97 
Socialites to Researchers and 
ministrators; (b) female stu dii- 
would reveal a pattern of larger tu- 
ferential preference than male 8 
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ч, (c) advanced students would 
mit ы stronger preference for the 
үре groups to the first two than 
б. m students; and (d) pro- 
roc ош liberal arts would be 
ra ed to those from education. 
ion predictions are based either 
FA ge findings or common ob- 
rs E^ reflecting prestige hierar- 
2 à; hus, this study was an ex- 
ee mn of student preference for 
of Ж. in relationship to (a) types 
tien m (b) students’ sex, (c) 
collegiate pmo зат level, and (d) 
eee, entification (source) of 


METHOD 


Subjects 


Ss 
gee eee study were 300 secondary edu- 
these IM at Kent State University. 
Seniors, were graduate students, 131 were 
5, and 82 were juniors. 


Procedures 


8; А 
the Sete administered in group sessions 
Bight coli instrument. 
item, 5. queis professors. This was an 8- 
structed bo ir Likert-type inventory Con- 
from Phil y the authors, taking their lead 
a Tube (1963, 1964) and Tannenbaum 
trator Son нен of professors (Adminis- 
WO colle cialite, Teacher, Researcher) from 
i sources (education and liberal 
graphs, Ss escribed in each of eight para- 
Duum fı rated each description on a conti- 
through rom “strongly prefer” (5 points) 
а) m neutral" (3) to "strongly reject" 
Fej ei descriptions follow. 
Sistant - M is an administrative as- 
has alwa: e Director of Admissions. He 
ministratio been interested in college ad- 
e deos and aspires to higher office. 
e Coll an academic appointment with 
teaches n of Arts and Sciences ап 
Course in ond has taught for 10 years) а 
8 loetorats ucational methodology. He has 
18 married ; from a well-known university, 
his сап and has three children. Besides 
Tegistrar, doo contacts, he works with the 
versity gdmintt treasurer, and other uni- 
Tators P NT as well as adminis- 
or Prof other colleges and universities. 
essor E, change the College of 


Arts and Sciences to the College of Edu- 
cation.] 

Professor B is advisor to the largest 
social fraternity on campus. He has been 
an active member of a lodge group for 
a number of years and aspires to gain 
greater social prestige. He holds an aca- 
demie appointment with the College of 
Education and teaches (and has taught for 
10 years) a course in educational method- 
ology. He has a doctorate from a well- 
known university, is married and has three 
children. Besides his classroom. contacts, 
he socializes with faculty, administrators, 
and non-university people on a wide basis. 
He is a frequent attendant at plays, con- 
certs, parties, and conventions. [For Pro- 
fessor F, change the College of Education 
to the College of Arts and Sciences.] 


Professor C is an academic advisor to 
students. He takes seriously his role as 
student mentor and devotes much time to 
the preparation of his lectures. He enjoys 
teaching and aspires to become a better 
teacher. He holds an academic appoint- 
ment with the College of Arts and Sci- 
ences and teaches (and has taught for 10 
years) a course in educational methodol- 
ogy. He has а doctorate from a well- 
known university, is married and has three 
children. Besides his classroom contacts, 
he works with students and colleagues 
toward more effective instruction. He 
spends & great deal of time helping stu- 
dents in their academie work. [For Pro- 
fessor G, change the College of Arts and 
Sciences to the College of Education.] 

a large-scale Te- 


cts e 
een inter- 


has always b 


ested in basic research an 

and his research activity. n so bs 
i ointment wit e 

academic арро! A angi 


nd teaches (an 
ourse in 
a doctora 
is marrie 


he associates } 


searchers. His research results in the pres- 


ion of papers at professional conven- 
Erg ike publication of articles in 
specialized journals. [For Professor H, 
change the College of Education to the 


see of Arts and Sciences.] | 
n бан rere assembled in ran- 


ptions W ) r 
dom order for each S on 8 44% X 8-inch 
cards. The instrument was then presented 
with t 


hese instructions: f 
As you may know, it is & frequent 
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practice at many colleges to employ in- 
structors for educational methods courses 
from either Arts and Sciences or Educa- 
tion faculties. It is also well-known that 
professors, regardless of their specialty, 
vary widely in terms of personal charac- 
teristics. The packet describes eight gen- 
eral types of professors. Read and rate 
these professors one at a time in the order 
they appear in the packet, i.e., first ap- 
pearing professor should be rated before 
reading the second description, etc. For 
example, your first professor may be any 
between A and H, but assume that it is 

D. Read D first and be careful to rate 

him in D's position on the answer sheet. 

Are there any questions? 

When scores on each type were correlated 
between sources for Ss, the following coeffi- 
cients were obtained: Administrator, .86; 
Socialite, .83; Teacher, .65; and Researcher, 
82. These were regarded as indexes of in- 
ternal consistency and deemed sufficient for 
the present study. 


RESULTS 


Mean scores obtained by sub- 
groups on the Eight College Pro- 
fessors are summarily shown in Table 
1. The results of an overall (sexes 
combined) analysis of variance by 
Lindquist’s (1953, p. 296) Type VI 
indicated significant professor Type 
main effects (р < 001, F = 343.35, 
ај = 3/891, MSeror = 1.18). Order 
of preference, as seen in Table 1, was 
Teacher, Researcher, Socialite, and 
Administrator. Further, the professor 
Type interacted significantly with 
student Level (p < 05, F = 2.54, 
df = 6/891, MSeror = 1.18). Student 
Level main effects approached the 
05 level with greater acceptance 
shown successively by seniors, gradu- 
ates, and juniors. In addition, pro- 
fessor Source main effects were sig- 
nificant (p < .05, F = 4.59, df = 
1/297, MSerror = -18) with education 
professors receiving higher ratings. 


1 Further analysis by  Ebel's method 
(Guilford, 1954, p. 395) indicated the mean 
reliability for a single S (rater) of 496, 
while the reliability for the mean of the 
eight professor ratings from the 300 Ss 


was .997. 
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Separate analyses for males and 
females revealed essentially the same 
patterns, and no major sex differences 
among these Ss were evident in rela- 
tionship to their faculty preferences. 
For both sexes the most noticeable 
difference from the overall results 
was a nonsignificant professor Source 
effect. 

When the two professor Sources 
were combined and Ss’ ratings оп 
the four professor Types correlated 
among themselves, coefficients be- 
tween the Administrator and the 
Socialite, Teacher, and Researcher 
were 25 (p < .001), —.02, and .14 
(p « .05), respectively. Those be- 
tween the Socialite and the Teacher 
and Researcher were .05 and —.04- 
Finally, the correlation betwee? 
Teacher and Researcher was —-19 
(р < .01). Since each S rated each 
type, these low correlations are inter- 
esting. The direction and size of the 
coefficients suggest that the image ? 
"teacher" holds in students’ minds 
stands more or less alone apart from 
any of the other types. 


Discussion 


In the confines of the present ех" 
ploratory study, it appeared that 
students indeed reveal differential 
preferences among different profes- 
sor types. In all three groups (male, 
female, and both), the order of pref- 


erence was ‘Teacher, Researcher: 
Socialite, and Administrator. The 
authors’ first prediction was i 


partially supported. The second Pr 
diction of a larger differential pref- 
erence among female than male stu- 
dents was not upheld, since both sexe? 
revealed essentially the same Pre 
erence patterns. 

In all three comparisons there were 
significant (p < .05) professor TYP? 
X student Level interaction е e 
The top rating of Teacher held, bU 
decreased from juniors to graduates: 


TABLE 1 


Mean Scores on THE Eran COLLEGE PROFESSORS BY PRorEssor Type, PROFESSOR Source, SUBJECT LEVEL, AND SUBJECT SEX 
Professor Ss 
Graduate Senior Junior All levels 
Type Source 
M F M+F M F M+F M F M+F M F M+F 
(N = 64) | (N = 23) | (N 2 8) | (N = 74) | (N = 57) (У = 131) | (N = 37) | (N = 45) | (N = 82) |( = 175)|(N = 125) (У = 300) 
Administrator A&S 2.89 2.70 2.84 2.59 2.88 2.72 3.16 2.87 3.00 2.82 2.84 2.83 
Educ. 2.89 2.65 2.83 2.59 2.89 2.73 3.22 2.84 3.01 2.83 2.83 2.83 
Both 2.89 2.67 2.83 2.59 2.89 2.72 3.19 2.86 3.01 2.83 2.84 2.83 
Socialite A&S 3.05 3.13 3.07 3.12 3.04 3.08 3.35 3.00 3.16 3.14 3.04 3.10 
Educ. 3.05 3.26 3.10 3.14 3.11 3.12 3.24 3.13 3.18 3.13 3.14 3.13 
Both 3.05 3.20 3.09 3.13 3.07 3.10 3.30 3.07 3.17 3.13 3.09 3.12 
Teacher A&S 4.58 4.57 4.57 4.64 4.49 4.57 4.76 4.78 4.77 4.64 4.61 4.03 
Educ. 4.70 4.48 4.64 4.77 4.56 4.08 4.76 4.80 4.78 4.74 4.03 4.70 
Both 4.64 4.52 4.61 4.70 4.53 4.63 4.76 79 4.77 4.70 4.02 4.66 
Researcher A&S 3.30 3.83 3.44 3.04 3.12 3.08 3.05 3.04 3.05 3.14 3.22 3.17 
Educ. 3.28 3.78 3.41 3.16 3.16 3.16 3.03 3.18 3.11 3.18 3.28 3.22 
Both 3.29 3.80 3.43 3.10 3.14 3.12 3.04 3.11 3.08 3.16 3.25 3.20 
All types A&S 3.45 3.55 3.48 3.35 3.38 3.36 3.58 3.42 3.49 3.44 3.43 3.43 
Educ. 3.48 3.54 3.50 3.42 3.43 3.42 3.56 3.49 3.52 3.47 3.47 3.47 
Both 3.47 3.55 3.49 3.38 3.41 3.39 3.57 3.46 3.51 3.45 3.45 3.45 
Note.—Scores assigned: strongly reject, 1; reject, 2; neutral, 3; prefer, 4; and strongly prefer, 5. 
^ A & 8 denotes the College of Arts and Sciences. 
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Socialite followed suit, but Researcher 
gained as Level increased. Adminis- 
trator decreased from junior _to 
senior, but regained his standing 
with graduates. The authors’ third 
prediction tended to be supported 
by this evidence. A longitudinal study 
to examine the suggested sequence 
is indicated. 

The fourth and final prediction of 
preference for liberal arts over edu- 
cation professors did not receive sup- 
port. In fact, the opposite preference 
was shown by these education stu- 
dents. 

It was further observed that the 
ratings were not highly correlated 
among themselves in spite of their 
“repeated measurement” feature. A 
focused nature of student perception 
is suggested here in agreement with 
Knapp’s (1962) observation discussed 
earlier. 

A limitation of this study is that 
the results could be a direct function 
of the specific professor descriptions 
offered. Nevertheless, the results sug- 
gest that, even though the traditional 
teaching function may have been 
relegated to secondary status, stu- 
dents still prefer to have a mentor as 
their professor. Administrators may 
impress them in various other con- 
texts but, so far as classroom instrue- 
tion is concerned, students prefer a 
full-time Teacher. A Researcher in- 
deed gains in stature as students 
climb the educational ladder, but 
again they would rather study with 
a Teacher. 

The simple instrument, Eight Col- 
lege Professors, seems to be adapt- 
able to many variations in the study 
of interpersonal perception and pref- 
erence among educational personnel. 
Professors’ specialty and background 
experiences may be systematically 
varied in the descriptions, students’ 
various levels and majors may also 
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be extensively explored, and differ- 
ent schools in different areas may be 
compared on their students’ differen- 
tial preferences. The instrument 1$ 
also applicable to groups of faculty 
members themselves. Some of these 
possibilities are now being investi- 
gated by the authors and will be re- 
ported later. Whether the expressed 
reactions correspond with these stu- 
dents’ behavioral reactions also merits 
a serious future inquiry. 
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EFFECTS OF W 
? WRITTEN VERBALIZATION A 
/ ND TIN 
OF INFORMATION ON PROBLEM SOLVING ees 
PROGRAMED LEARNING’ 


Se Б 08 J. SEIDEL ax» IRIS C. ROTBERG 
man Resources Research Office, George Washington University 


Traini 
ning on computer-program (CP) writing was given to 60 high 


scho i ; 

(a) ao ina 3 X 2 factorial design concerned with effects of 

miting E explicitly the rules used in constructing the CPs, (b) 

(c) um ue of these rules in conjunction with writing CPs, or 

tion R 5 only the CPs. The other factor was prompting vs. confirma- 
. Results indicated that: (2) during learning, prompling was signifi- 


cantly superior to confirmati 


criterion tests; (b) i i iti 

ter ests; naming the rules in addition to 

анай aided later performance wl 
he eriterion tests; (c) writing ru 


on, but a reverse tendency appeared in the 


io writing CPs during 
hen writing more complex CPs 
les during training actually hin- 


de i Ca 
red Ss in writing CPs later on the criterion tests. 


MEN LLL 


M 
sione d the research concerning the 
&ramed i learning factors in pro- 
вавазана MER (PI) has been 
Serial lear With paired associates OT 
Where ex ning. Little has been done 
een йн an eae treatments have 
Organized ipulated for hierarchically 
his type pai discourse? Yet 
Medium fe discourse is precisely the 
gramed j or most, if not all, pro- 
na 2 structional course work. 
by с ше of serial learning studies 
& Spitze and associates (e.g, Cook 
daine (195 1960), Angell and Lums- 
aS been р, and others as well, it 
Significant ound that prompting is 
ly more effective than 


Performed by i reported in this study was 
watch Office the Human Resources Re- 
ony ids of George Washington Uni- 
the Army, hee to the Department 
49 necessarily he contents of this paper do 
th the Bar represent the official opinion 
р. are ind, ient of the Army. The au- 
us elpfu] aa ted to Eugene A. Cogan for 
-on Of this uggestions during the prepara- 
Es Of this manuscript. A preliminary ver- 
À chological ^4 was read at the American 
p Бег 1964 ssociation, Los Angeles, 

есе; x 
4) ij Study by Hickey and Newton 
бе ы attempt. Also, for an 
ion of problems in doing 


on i 
А conceptual material, see 
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or learning. One study 
by Gagné and Brown (1961) dealing 
with conceptual material compared 
several combinations of prompting 
and confirmation (although they did 
not use these terms) in the learning 
of principles for generating number 
In their study, the guided- 
firmation) group, me- 

condition, appeared 
e, while the rule- 


pting) condition, 
was least effec- 


confirmation f 


series. 
discovery (con 
dium-diffieulty 
to be most effectiv 
and-example (prom 
low-difficulty level, 


tive. 

Thus, Angell and Lumsdaine's 
(1961) results suggest that the sub- 
jects (Ss) should be given substantial 
information before being required to 
respond, while Gagné and Brown's 
(1961) results suggest that Ss should 
be required to discover the answers 
themselves. ise factors that 
caused the 
cannot be determined because of the 


differences 


etween conditions 


within an experiment. For example, 
the contiguity of correct stimulus- 


response association Was not held 
constant across prompting and con- 
firmation conditions, nor were cor- 
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rect responses necessarily given the 
same number of times nor in the same 
manner. А А 

For а meaningful comparison of 
prompting and confirmation, the sole 
element manipulated should be the 
amount of information given to S 
prior to requiring him to respond; 
whatever is not given prior to response 
should be given after response to 
make total information given com- 
parable for the confirmation and 
prompting conditions. A second fac- 
tor of interest in training for concep- 
tual problem-solving proficiency is 
the effect of the act of verbalizing, 
during the training conditions, the 
rules that are being used in the prob- 
lem solution. 

Paired-associate experiments gen- 
erally require S to make a series of 
rote responses, and his learning is 
measured by the ability to give back 
these rote responses. Conceptual learn- 
ing, however, requires the abstrac- 
tion of mediating responses; the suc- 
cess of learning is measured by the 
ability to apply these mediating skills 
to a variety of different situations. 

Many current PI programs are 
parallel to paired-associate learning 
situations in that they seem to em- 
phasize reproducing material pre- 
sented as written constructed re- 
sponses. Yet the program developers 
seem to be interested in teaching 
concepts (mediating skills). The ques- 
tion that arises then is: Does ver- 
balizing the content of a rule, where 
separably measurable from mediat- 
ing responses, aid or hinder a con- 
cept-learning and problem-solving ca- 
pability? Recently, a study by Gagné 
and Smith (1962) suggested that the 
verbalization of principles takes on 
importance as the complexity of the 
problem increases. —! 

The present study, then, examine 
the influence of both of these factors 
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—prompting versus confirmation, and 
alternate ways of verbalizing rules— 
on learning in a conceptual problem- 
solving context. 


METHOD 


Six independent groups of Ss in a PI en- 
vironment learned to write increasingly com- 
plex computer programs (CPs). To explore 
the verbalization factor: (a) One-third of 
the Ss periodically wrote out the content 0: 
the rules used to guide the writing of. CPs 
(Rules Group); (b) one-third periodically 
wrote down the names of the rules (Nam- 
ing Group); and (c) one-third wrote CPs 
without any verbalization of the rules (CP- 
Only Group). In addition during learning: 
(a) One-half of the Ss were required to 
write their answers after being given the ex- 
plicit information needed in the response 
(prompting)*; and (b) one-half were "E 
quired to write their answers prior to 5A 
ceiving the information (confirmation). A 
ror rates were obtained during learning 0 
CP writing for all groups and on rules a” 
naming for the appropriate groups. Р 

Treatments Mem aan ered during tho 
PI training, and their effects were nol К 
criterion tests given at the end of each 
two phases of the instruction. 


Subjects 


Ss consisted of 60 paid volunteer high 
School students! who had been он 
that they were to be paid between $1.50 2 aa 
$2.50 an hour, depending upon the Les 
ficiency they exhibited in learning. Ss Wor" 
assigned at random to one of the six СОЛ! A 
tions (V = 10) in a 2 x 3 factorial a 
—Prompting versus Confirmation аз © 


“In general, as each new concept m 
introduced S was given a body of iet 
tion covering its functions. Obviously, or 
some degree of global prompting existe nse 
all treatments, so that in the strict Seins 
any prompting-confirmation compare, ie 
are relative. However, unless the i ne 
menter waits for his human Ss to div 
the proper "verbal bar press" tror tod 
enormous response repertoire, all such § ilar 
les must of necessity involve a Sim 
event. 

* The authors are grateful to the p 
and staff of Wakefield High School, 5 
ton County, Virginia, for providing 
this study. 


rincipal 
Arling- 
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fact 

usn. LUE the three verbalization condi- 
and cles н) other. A combination of verbal 
EU Ew elements of the Army Classifi- 
y intelli CH was used to classify Ss 
sso m ien level for the detailed evalua- 


Materials 


тее Material constituted a subcourse of 
United с; Specialist Course at the 
отада E Army Signal School (Fort 
Bramin › NJ.) and covered primary pro- 
Б commands of a business-type digi- 


tests; 5 
; booklets used by the CP-Only Group, 
had halt d tule-verbalization requirement, 
uction to аў number of pages. An intro- 
Programin Symbolic language for computer 
d 8 included in the course. The 
computer ^ derived from a task analysis of 
ierarchia] атп, with the material 
bley, Y organized from simple to com- 


The ; 
creasing amin task was writing CPs of in- 
quirement Penity, with the additional re- 
treatments peed by the experimental 
Cach new e he treatments were applied as 
the attendant ne was introduced along with 
s, 48 Rake requirement of writing 2 CP. 
th меге m de possible, the goals of all 
hem with de comparable by orienting 
Objectives "rLi-Page introduction on course 
dic questi his orientation included peri- 
Until he wW: lons which 5 answered covertly 
9f them F Satisfied with his understanding 

e exit Cronbach, 1965). 

em tests for Phases I and II 
É а БАНГ computer-programing 
Fort Hua p by a programers' group 
i g instr chuca (Ariz.) and by pro- 
| Outh Šignal uctors from the Fort Mon- 
28 Were p School. Where necessary, prob- 
QC woul Simplified by removing aspects 
оре of th Tequire techniques beyond the 
че Criterion Course. As a secondary part of 
ЖЫШ the р 1.1685, the ability of all Ss to 

Uch the utes and the names of the rules 
Measured, used in writing the CPs was also 
of 16 8a 3 

f the F Tes rule given below is illustrative 

e CI And Naming treatments: 

any eres SOCSEC instruction clears 

and copies. already in the accumulator 

The OCSEC ot number from location 
th Ortio in 

1е Rul cu underlined above was given to 

Чр, and these Ss were to fill in 


Stamin, 
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the rest, whereas the nonunderlined portion 
was given to Ss in the Naming Group, and 
they filled in the part underlined. Since 8 
worked on only one page at a time, the 
Prompting-Confirmation variable can also 
be illustrated. For the Prompting Group the 
entire sentence was presented first; S turned 
the page and completed the blank. The 
opposite held for ihe Confirmation Group. 


Procedure 

The course was administered in two 
phases. On Day 1, students were given 
Phase I, the section concerned with primary 
programing commands and a criterion test 
for that phase. On Day 2, they were first 
given a retention test consisting of four 
problems to be programed; this was not a 
timed test, but all Ss were through within 
30 minutes. Immediately following retention 
testing, they were given Phase II of the 
course, which covered more complex CPs, 
and the criterion test for that phase. On 
Day 3, Retention Test 2, four programing 
covering all the course material, 


problems 
was given, followed by the intelligence meas- 
ure. 

a maximum of 


The students were allowed 
4 hours on each of the 2 days of training, 


but only one or two students required that 
much time on cither day. Booklets were 
checked for each student at the end of his 
ng day to ensure that he was actually 
working with the material and not simply 
staying for his pay. ‘All students appeared 
highly motivated to perform; many stayed 
after hours to ask further questions about 

(No substantive questions 


the course work. s e qu 
were answered, to avoid contaminating ex- 


perimental treatments.) 


RESULTS 


worki 


Evidence for Learning 

On the test for Phase I, nearly 
two-thirds of the Ss achieved 80% 
correct or more on the program- 


writing material, and more than a 
third of the Ss scored 90% or more. 
Results for Test 2 were comparable. 
These scores on the criterion tests 
for the principal dependent variable 

aming—and scores 


— computer prog" í е 
on the retention tests are given in 
Table 1. Also included are results of 


Bartlett tests on the variances. 
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TABLE 1 
CRITERION AND RETENTION Test MEANS AND VARIANCES BY TREATMENTS 
Training group SD 
Test Summary A = 
ed Rules | Naming) СР mean correct | Rules | Naming | CP-Only 
+ СР | + СР | -Only group | group | group 
ee 
Criterion Test 1° = 
Prompting 39 43 47 43 83 14.84 | 8.66 | 7.99 
Confirmation 43 49 44 45 87 10.95 | 3.01 | 7.55 
Summary mean 41 46 46 
% correct 79 88 88 
Criterion Test 2* " 
Prompting 66 66 75 69 78 28.76 | 15.75 | 10.63 
Confirmation 69 79 77 75 84 19.44 | 7.62 | 11.96 
Summary mean 68 73 76 
% correct 76 82 85 
Retention Test 1 (%)| 76 87 84 82 ns 
Retention Test 2 (%)| 70 80 68 78 ns 


* Maximum possible score: 52. 
b x2(5) = 15.52, р < .01. 
* Maximum possible score: 89. 
d x?(5) = 18.16, p < .01. 


In all cases, intelligence, as indi- 
cated by scores on the Army Classifi- 
cation Battery, was significantly and 
positively correlated with the criterion 
scores. No interactions appeared be- 
tween intelligence level and the experi- 
mental treatments on the principal 
measure, CP writing. 


Treatment Effects 


Learning time. For both Phases I 
and II, prompting and confirmation 
treatments did not differ significantly 


TABLE 2 
LrARNING-TIME MEANS BY 
VERBALIZATION TREATMENTS 


Means (in min.) 


Rust Малы CP-Only 


115.1 
101.6 


116.3 
92.6 


134.5 


se Т 
A 107.0 


Phase II^ 


a F(2.54) = 5.391, p < .01. 
b F = 3.798, p < 05. 


in time to complete the course: m 
verbalization groups did differ sign} 
cantly on Phase I (F — 5.391) and on 
Phase II (Р = 3.798). Table 2 gv? 
the mean time scores for the verbali- 
zation-treatment conditions. TP 
treatment variances did not differ 818" 
nificantly. : 
Significance of the F ratios ie 
Phase I for the verbalization varial 
was due principally to the fact that th 
Rules Group took longer to comp ° 5 
the material than did the other o 
verbalization groups. In Phase II ihe 
Naming Group was faster than 
other two groups. А 
Since the CP-Only Group did um 
have the rules or naming items pis 
their programed instructional peur 
rials, they had about half as m 
booklet pages as the other two ra 
Despite this, the CP-Only Gr in£ 
did not finish faster than the Nam f 
Group in either Phase I or phase 
Because of this fact and 
achievement generally was Е 


s iD 
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TABLE 3 


Timp-W- 
-WEIGHTED № SS ON 
Tens ох CRITERION: PRoMPTING VERSUS CONFIRMATION 


Criterion Test 1 


Criterion Test 2 


Retention Test 1 Retention Test 2 


Measure 
Prompting | Confir- i Confir- i 
= - | mation mS ооа Prompting uM Prompting mation 
Writing 368 | 380 | 
aming t] : 719 750 427 
Nam Süle А 59 | 21 442 423 468 
ting the rules 17 e | i E 


Note.—Score = Achievement 
"ONSE X 1,000. 


Ahd ie 
of n E correlated with speed 
ance on te further analyses of vari- 
With a ti e criterion tests were done 
individual, ne-weighted score for each 
% criteric computed by dividing 
plete the on Score by his time to com- 
Wen RS: 
Ment oo The general treat- 
Scores w. cts on the time-weighted 
: Proms as follows: 
uring ea ting-Confirmation Results. 
dition — 7 the prompting con- 
confirmatio significantly superior to 
terion laste t However, on the cri- 
Significant the groups did not differ 
Con: "am and, if anything, the 
Superior lon Group tended to be 
Same fader shown in Table 3. The 
tests, eney holds for the retention 


Co я 

s tee prompting versus con- 
$vels "ud anie the three intelligence 
dores, e eiae (on all criterial 
Б numbe ing Rules and Names) 
lon Gro T of times the Confirma- 
ri the pal means exceeded those 
teu est 2 pi) Group (using Test 
me nt “Posy rici tests in 

y i io of 17 to 6 superi- 
fou x the Confirmation obe 
uring learning, on the other 


and 

Bram es rates were: Phase I, 

па ÉTmation me- Prompting = 3%, 
= 10%; rules and 


in 

n ie Prowintine 2 

Nation орып = 6%, Confir- 
= 24%. In Phase II, program 


writing—Prompting = 2%, Confir- 


mation = 5%; rules and naming— 
Prompting = 10%, Confirmation = 
44% 


b. 
2. Verbalization Results. Turning 
to the F ratios for the other variable, 
verbalization, it is found that Ss re- 
quired to write the content of the 
rules during training did not do as 
well in writing CPs on the criterion 
tests as Ss who had simply to write 
the names of the rules during train- 
ing or Ss who learned without either 
additional requirement. It should be 
noted that, as shown by the un- 
weighted means in Table 1, the Rules 
Group also had the highest variance 
on the criterion tests. (This effect was 
particularly evident with Ss of lower 
intelligence.) 

Table 4 shows tha 
the means of the three group 
writing—the criterion measure of 
est—the mean of the 
Rules Group is consistently lower 
than the mean of the Naming Group 
or the CP-Only Group. Two-group 
comparisons (Scheffé, 1959) indicate 
this difference is the basis for statis- 

in all cases. 


tical significance in | 
for further study is sug- 


e change in the ordering 


on the program-writing measure be- 
tween Test 1 and Test 2 for the Nam- 
s the CP-Only Group. 


ing Group Versu 
On Test 1 and Retention Test 1 


& if we order 
s on CP 
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TABLE 4 
Timm-WEIGHTED MEANS ON CRITERION: VERBALIZATION VARIABLE 
| Тезї 1 Test 2 Retention Test 1 Retention Test 2 
| i - = 
е аш ка S | t ш; ш мад Sy | Sl [Sie 

CP writing |426 | 314 | 397 | 753 | 624 | 793 | 486 | 356 | 463 | 412 | 385 | 521 
Ko er | 29 | 40 | 42] 39 | 55 | | | 
"o Be! oo] ac | uo | de| ar| 4 | | | | 


Achievement 


Note.—Score = - 
Note.—S Time 


X 1,000. All three-group comparisons except Stating the 


rules, Test 1, were significant (p < .05). For Naming the rules, Test 2, p < .01. 


the CP-Only Group made the highest 
scores, while on Test 2 and Retention 
Test 2 the Naming Group seemed to 
be superior. This tendency, however, 
is not statistically significant. This 
finding is being explored in a larger 
study dealing with the same introduc- 
tory content, but with more complex 
concepts of computer programing as 
well. It may be, as noted in a recent 
study by Gagné and Smith (1962), 
that verbalization of principles is not 
important except for more complex 
material. 

We should also note that the in- 
ferior performance by the Rules 
Group in CP writing was not simply 
a result of poorer learning generally. 
The Rules Group was superior to the 
other groups on the criterion requir- 
ing the writing of rules. In Phase I 
this was not significant, but in Phase 
II it was (p < .05). Thus the Rules 
Group learned the verbal materials 
better, but at the expense of not 
having learned how to use them. The 
Naming Group, on the other hand, 
answered the naming questions on 
the criterion significantly better (p < 
.05) than the other two groups and, 
consistent with the Gagné and Smith 
finding, benefited from this added 
background in CP writing on Phase 


II. 


One final finding was that the cri" 
terion measures were highly reliable 
by Hoyt's test (ef. Johnson, 19049). 
On CP writing the coefficients range 
from 79 to 89 (p « .01). 


Discussion 


The data from the prompting кя 
sus confirmation comparison indioa g 
the necessity for caution in apply 
findings established in раітей-288005 
ate or serial learning to paradigm? 
involving a highly organized, ? , 
stracted learning, such as learn” 
how to write CPs. In the latter c8** 
the material being taught is а me A 
ating set of knowledges rather tha 
mechanical rote responses um 
performs during the course itself the 
in paired-associate learning). In * S 
present study it was necessary E 
to extract from his mechanical WT i- 
ing responses the appropriate yc 
ating stimulus-response connec i 
required to handle the concep 
problem of writing. - 

Viewed in another light, the prompr 
ing-confirmation data also carry een 
plieations for the relation un ed 
error rate during learning and 800 jp 
on the criterion. The present a th 
dicate that in the present contet ning 
effect of error rate during Jo sega" 
washed out or even tended to be P 
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ti rc 
eed — to errors in end-of- 
evidence ire Other suggestive 
in a recent his relationship is found 
for trai nt evaluation of a PI course 
ipud salesmen (Basic Systems 
Nn rige 1963). The authors’ 
ror im oe the relationship of er- 
centage ‘i uring learning and per- 
Showed : Pb on the criterion test 
Was "e pearman p of —.32, which 
leve] (1 quite significant at the .05 
one-tailed. would have been for a 
With ehe ge I MI certainly agreed 
Deaton — finding. One possible 
S comb not getting a larger p was 
(NV = geris of a small sample 
Criterion and a ceiling effect in the 
tween 99 one which ranged be- 
cases, —— 0% correct in all but two 

Se : 
tion peel with respect to verbaliza- 
"-— defini 15 in a name—or in a rule 
in which we important. The ways 
either p re ask 8 to verbalize (e.g., 
his ЖЕҢ Writing, as was the case in 
a Or by oral recitation) 
Sidering Aen into account when con- 
mec ud effects of this technique. 
Written 2 requiring the use of 
tules) аса amar (the names of 
Naming one learning enabled the 
Well ag h ‘roup to learn the rules as 
hand, ive, to use them. On the other 
аз the Rul the contextual restraint 
Wiring n 8 Group was, and re- 
Preventeq mt of the entire rule, 
Clently | s from also learning effi- 
Com, 9 V to use the rules in solving 

Puter problem, 


ж 
foung ously, Gagné and Smith (1962) 
compos advantage in requiring S 
Ü. Was Din and verbalize the rules 
lon task ng in a concept-identifica- 
Case the Es However, in the present 
иеа Tules S was required to write 
ally MuR that they were contextu- 
Wire ve CBE, First, Ss were re- 
a com Papina the rule closely 
pletion format; second, if а 
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computer command were involved 
S was not simply asked the general 
principle behind the command, but 
rather he was asked to interpret a 
computer instruction with the com- 
mand applied to a specific meaning- 
ful program context. 

Requiring Ss to write contextual 
rules in this manner, in addiüon to 
solving problems, should indicate the 
value of teaching Ss to reiterate the 
concepts mediating the specific prob- 
lem-solution performance which they 
are being taught. The naming re- 
sponses that were required in the 
problem-solving performances should 
indicate the value of the use of 
mnemonics or reduced cues as an aid 
to learning the performance-oriented 
response (computer programing in 
this study). 
In attempting to account for the 
ssible advantage of having S name 
sed to repeating a 
f words comprising 
a rule as the experimenter has given 
it to him, the possibility must be 
taken into account that S will per- 
form best when he can use his own 
repertoire of verbal responses. Gagne 
and Smith provided the widest possi- 


ble latitude in this framework by 
the rule him- 


po 
the rule as oppo 


stereotyped set o 


this approach. 
of the importance 
for conceptual lea 
ther investigation. 


REFERENCES 

ANGE „ & LUMSDAINE, A. A. The effects 

pron trials and partial-correction 
procedures on learning by anticipation. 
(Research Report ATR-C14-9/61-SR6) 
Pittsburgh, Pa.: American Institute for 


arch, September 1961. 
eg и INCORPORATED. WATS: De- 


Basic SYSTEMS, т 
velopment and field testing: Report of 


ming merits fur- 


158 


ihe development and installation test 
data from a programmed self-instructional 
training course. (Technical Report de- 
veloped for AT&T) New York: Basic 
Systems Incorporated, 1963. 

Coox, J. O., & Sprrzer, M. E. Prompting 
versus confirmation in paired-associate 
learning. Journal of Experimental Psy- 
chology, 1960, 59, 275-276. 

Свохвлсн, L. J. The logic of experiments 
on discovery. Paper read at Social Science 
Research Council, Committee on Learn- 
ing and the Educational Process Confer- 
ence, New York, January 1965. (Stanford 
University, United States Office of Educa- 
tion, Cooperative Research Program). 

Ericksen, S. C. The piace of thinking in an 
ideal university. American Psychologist, 
1962, 17, 763-771. 

Gaax£, R. M., & Brown, І. T. Some factors 
in the programing of conceptual learning. 


Rosert J. Sewer anD Iris C. RorBERG 


Journal oj Experimental Psychology; 
1961, 62, 313-321. 

Gaoxé, В. М., & Ѕмітн, E. С, JR. А study 
of the effects of verbalization on problem 
solving. Journal of Experimental Psy- 
chology, 1962, 63, 12-18. 

Jounson, P. O. Statistical methods in te- 
search. New York: Prentice-Hall, 1949. | 

Hickey, A. E, & Newton, J. M. The logi- 
cal basis of teaching: I. The effect of sub- 
concept on learning. Office of Naval Re- 
search Report, January 1964, Contract 
Nonr-42 15(00). 

Scuerré, Н. The analysis of variance. New 
York: Wiley, 1959. 

Traws, В. E. The importance of problem 
heterogeneity to programed learning. (Re- 
search Bulletin 64-26) Princeton, : 
Educational Testing Service, 1964. 


(Received June 28, 1965) 


Journal of Educati 
1890, Vol, 67, No, 3, 108 103 i 


— OF EXPERIMENTALLY LEARNED WORD 
SSOCIATIONS ON THE ACQUISITION OF 


READING К 
S. JAY SA 
University of 


Pi airs of words were learned by 44 
training. They were then given rea 
ent pairs of words and tested for wo 
on the 2nd word of each word p 
word pairs were the same for 


the mean recognition score was 1.46. 


neutral treatment, where word pairs wi 
the means were 
-association training wa: 


and reading training, 
ment, where no word: 


These differences were significant (p 


tion scores were associated with faster spee 
SS, 


E. ша specialists and psycholo- 
on Satin stated that performance 
either s tasks is improved with 
zation bog or response familiari- 

e im asta (1960) has emphasized 
zation = of stimulus familiari- 
transfer y demonstrating positive 
waak e earning to seade list. of 
Pretroinin en visual discrimination 
Words umi was given with the same 
Strickland later in reading training. 
and Lefevr (1962), Ruddell (1965), 
the "me (1964) have also stresse 
ing by st or familiarization in read- 
Written ating that primers should be 
Patterns PT the same syntactical 
aving HRK use in speaking. 
leading children dietate their own 
LK Begg To as in the experi- 

achievi method, is another means 
Mm Gan Аарон familiariza- 
Doleh (196) 1963; Goodman, 1963). 
eginnin н. believes that for many 
in. eaders, the course of learn- 


1 

Thi А 
submitted Р. 15 based upon а dissertation 
t; O the University of California 


OS А: е 
тесрціте 0280168 in partial fulfillment of the 
he 


to hu 


Wi Phillip Gough, an 
od for their helpful advice and 
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air. In the 
word-association an 


ESPONSES' 
MUELS 
Minnesota 


15 graders during word-association 
ding training on the same or differ- 
rd recognition and speed of recogni- 


facilitation treatment, where 
d reading training, 
Tn the interference treatment and 
ere different for word-association 
.84 and 82. In the control treat- 
s given, the mean was 34. 
and higher word-recogni- 


« 01), 
{ recognition (p € 01). 


ds o: 


ad from experience charts 


ing to re 
consists of first memorizing the se- 
quence of verbal responses which 


he chart, followed later 
to read the words, that 
ting the verbal response 
with the printed word stimulus. 

The facilitating effect of familiari- 
zation has also been demonstrated 
with verbal tasks in which the sub- 
ject (S) is required to recognize 
flashed words. As the amount of 
visual preexposure to nonsense words 
increases, the time necessary to rec- 
ognize these words decreases when 
they are presented with a tachisto- 
scope (Solomon & Postman, 1952; 
Sprague, 1959). Miller, Bruner, and 


Postman found 
nonsense 
different or 
English, the 
proximation, 
recognition 
and Gold (1963) 
amount of informa 
containing 8 missing 
the time required to recognize 
missing word decreases. 

The positive effect of familiariza- 
tion has also been demonstrated with 
word associations. O'Neil (1953) and 
Rouse and Vernis (1963) demon- 


appear on t 
by learning 
is, by associa 


ders of approx! 
higher the orde 
the lower 
threshold, and Tulving 
note that as the 
tion in a sentence 
word increases, 
the 
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TABLE 1 


PARADIGM TO ILLUSTRATE 
EXPERIMENTAL DESIGN 


Word-recogni- 


Word-association 
tion test order 


L-associat ing trainin 
familiarization | Reading E 


Si-Fa S:-Ri m 
pLuE-dominoes | BLUE-dominoes N thumbnail 
S:-Rz S:-Rz P R: 
PLAY-Íairy PLAY-faerie faerie 
5з-Вз 5з-В‹ Re 
rAT-thumbnail | EAr-bumblebee X chequer 
8‹-Ва ScR: x Ta 
pig-bumblebee | nrG-thumbnail bumblebee 
Si-Rs Ере: Re 
BLACK-music RED-muzjik » liquorice 
Sio-Re ScRe Ri 
suy-liquorice MAKE-liquorice dominoes 
None S-R; В; 
BLACK-martial C martial 
None Sen; x Rs 
LIKE-chequer?* muzjik 


* Pronounced “checker.” 


strated that when word associates 
are tachistoscopically exposed in suc- 
cession, recognizing the first word 
aids in recognizing the second word, 
and the stronger the association be- 
tween the words, the lower the recog- 
nition threshold. 

The purpose of the present study 
was to investigate what effect ex- 
perimentally learned word associa- 
tions would have on learning to read 
the second word in a two-word chain 
when only minimal help was given 
during reading training. All Ss were 
able to read the first word in the 
chain, whereas none were able to 
read the second word at the start of 
the experiment. 


МЕтнор 


Ss in the investigation were 48 first grad- 
ers from a Los Angeles elementary school 
who were randomly selected and assigned 
to one of four levels of the design. 

A 4 X 4 Latin-square design was used. 
Each S received four treatments in succes- 
sion. For each level of the design, the 
order of treatments and the word associates 
were counterbalanced. 


S. Jay SAMUELS 


As seen in Table 1, in the facilitation (F) 
treatment, the word associates learned dur- 
ing familiarization were S:-R: and S-R:. 
During reading training, S:-R: and SR: 
were presented. 

In the interference (I) treatment, S-Rs 
and SR. were learned. During reading 
training, SR. and S,-Rs were presented. 

In the neutral (N) treatment, Sa-R« and 
Si-Re were learned. During reading train- 
ing, S;-R« and So-Re were presented. 

In the control (C) treatment, no word- 
association familiarization was given. Dur- 
ing reading training, S—-R; and Ss-Rs were 
presented. 5 

Ss worked individually with the exper 
menter (Е). They were first pretested 0D 
all the stimulus and response words use 
in reading training, then given word-assocl- 
ation familiarization on six pairs of words, 
then reading training on eight pairs of words, 
and a final test of word recognition. 


Pretest 


To test the effect of word associations 
on learning to read the second word in es 
word pair, it was essential that Ss be 2 d 
to read the first (stimulus) word pub 25 
the second (response) word at the start A 
the experiment. Therefore, cach S was A 
tested on eight stimulus and eight respon 
words used in reading training (see e 
1) The stimulus words were at the P' Е 
primer level, whereas the response wor! t 
were from the Thorndike-Lorge (1944) lis 
of words which appear less than once JP 
million times in print. А 

АП visual stimuli used in this exper? 
were typed with a primary-style typewr 
on 5 X 8-inch index cards. sg the 

The pretest consisted of presenting á 
cards one at a time to S for 20 seconds pon 
asking him to read the word typed OD ti 
card. If he failed to read any of the ЯШ eo 
lus words, ог if he read any of the resP? 
words, he was eliminated. 

Four Ss were eliminated, one fro 
level of the design. 


ment 
‘iter 


m each 


Word-Association Familiarizatio" 


к" fa- 
After the pretest, word-associatio t je 1, 
miliarization was given. As seen 1n 89 eth 


S learned six pairs of words in succer : 
For the first pair, E and S alternately 8^5 
the word pair five times. For examp LUE 
said, "nLuE-dominoes," then S said, P the 
dominoes.” Then Æ alternately gave ord 
stimulus word and S gave the response » and 
five times. For example, Æ said, “BLUE ох 
S said, “dominoes.” Following this, the 


Wonp-AssocrarroN TRAINING AND READING 


ai " 

M uos was learned in a similar man- 
begun. During the re of the first pair was 
the stimulu E Ue ТӨӨ E alternately said 
Word five cca and S gave the response 
first word пев After the review of the 
learned, The wi the third pair of words was 
of the six vem md и continued for each 


Reading Training 


F © A 
ing Rey familiarization, reading train- 
each ер a in two stages. In Stage 1, 
card атайы inis was presented once, on a 
tion E tea to Figure 1А. At each presenta- 
the card "s ре pair of words at the top of 
at the top * then pointed to the word pairs 
Pointed аш read them. Following this, 8 
at the bott he multiple-choice word pairs 
Pairs at 000 which matched the word 

After 8; top and read them. 
elp from peed the cight pairs of words with 
Stage, at no’ Stage 2 was begun. During this 
9. As қос о time did 2 read any words for 
three 5000 іп Figure 1, Parts А, B, and C, 
Cards Were ed kinds of reading-training 
Structed t presented in sequence. S was in- 
the card a ponte to the words at the top of 

atching па read them, then to select the 
them "6 Words at the bottom and reac 


For 

each wor AS 

80 that if S word pair, six cards were used 
was able to read the words, he 


Oral 

response 

"or e cr the response word alone. 
Cards like Fi treatment, seven additional 
te Same nu apes 1A were used to provide 
quting P of opportunities to respon 
during wor ae training that were misse 
d-association familiarization. 


Wor, 
se Recognition Test 
med 
* Word italy following reading training, 
ШЕ терер на test was given on the 
jaming Were d words. Two measures of 
tee Tesponses used—number of correct read- 
i, Or each | and speed of recognition. The 
re test c treatment was 40 seconds long. 
Ne at a ti Onsisted of two words, presented 
1me for 20 seconds. 


RESULTS 


aber 
7 of Correct Reading Responses 


iseen з 
o Me a 2, the mean num- 
the word reading responses given 
-recognition test was .34 
treatment, .82 for the N 
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Card 1 Cards2and3 Cards 4 
5,and6 
o о о 
LIKE chequer| |LIKE chequer| | chequer 
like money cheese chequer 
like cheese chequer money 
like chequer paper paper 
like paper money cheese 


A B C 
Fra. 1. Examples of three kinds of cards 
used in reading training. 


treatment, .84 for the I treatment, 
and 1.46 for the F treatment. 

The analysis of variance for re- 
peated measure Latin squares (Winer, 
1962) disclosed that neither the order 
of treatment presentation nor the in- 
teraction between treatments and list 
was significant. The main treatment 
effect of word-association familiariza- 
tion on learning reading responses Was 
significant (F = 25.47, df = 3/120, p 
« .01). 

The Newman-Keuls tests on pairs 
of means (Winer, 1962) indicated 
that on the word-recognition test the 
F treatment was superior to the I 
treatment (p < 101), the N treatment 
101), and the C treatment 


ment (p < 

p < 01). The N and I treatments 
were both superior to the C treatment 
(p< 101). The I treatment was not 
significantly different from the 
treatment. 

Speed of Recognition 


able 3, the mean recog- 
ds for the two 


34.66 for the 
28.29 for the N treat- 
r the I treatment, and 


he F treatment. 
The analysis of variance for re- 
eated measure Latin squares (Winer, 
1962) disclosed that neither the order 
of treatment presentation nor the 
word-association list used nor the in- 
teraction between treatments and list 
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TABLE 2 
Mean NuwBER or Correct READING RE- 
SPONSES AND STANDARD DEVIATIONS ON 
THE Worp-REcoGNITION TEST 


Treatment 


Control | Neutral | nter- | Facilita- 
M .84 .82 .84 1.46 
SD .30 .68 .80 .69 


was significant. The main treatment 
effect of word-association familiariza- 
tion on speed of recognition was sig- 
nificant (F = 23.01, df = 3/120, p < 
.01). The Newman-Keuls test on pairs 
of means (Winer, 1962) indicated that 
speed of recognition was faster for the 
F treatment than the I treatment (p 
< .01), the N treatment (p < .01). 
The recognition speed was faster for 
the N treatment than the C treatment 
p < .05), and the I treatment was 
faster than the C treatment (p < 
01). The speeds of recognition for 
the I and N treatments were not 
significantly different from each 
other. 


Discussion 


The results of this experiment in- 
dicate that experimentally learned 
word associations can affect the ac- 
quisition of reading responses. The 
highest word-recognition score and 
fastest speed-of-recognition score was 


TABLE 3 
MEANS AND STANDARD DEVIATIONS 
IN SECONDS ron SPEED 
or RECOGNITION 


Treatment^ 


Control | Neutral | inter, | Facilita- 
M 34.66 | 28.29 | 27.50 | 17.93 
SD 9.23 | 10.67 | 12.46 | 13.35 


a Each word was exposed for 20 seconds 
on the two-word test. 
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found in the F treatment in which 
the same word pairs were presented 
for word-association familiarization 
and reading training. A second find- 
ing was that even in the I and N 
treatments, where the word assocla- 
tions conditioned during familiariza- 
tion were not designed to facilitate 
reading acquisition, superior learn- 
ing occurred when compared to the 
C treatment, in which no word-asso- 
ciation familiarization was given. 

The familiarization effect may be 
used to explain why there were su- 
perior test-performance results with 
all treatments in which word assocla~ 
tions were learned. The response 
word, which was the second word in 
each of the six word pairs memorized 
during familiarization, was presente 
during reading training and on the 
word-recognition tests. These learne 
response words were available to 
either for learning during reading 
training or guessing on the PT 
recognition test. In the C treatme?” 
where no word-association familiari 
zation was given, although S W? 
told the response word once, Lg" 
was sufficient forgetting, so that He 
most cases he was unable to recall t " 
response word and consequently wa 
unable to learn to read the word use 
in that treatment. 

On the word-recognition test, à 
score for a correct reading repo 
might have come about by S readin 
the word because he had learned 3 
or by guessing. Each S had 20 ipm 
for each word on the test. If, ОЛ cs 
test, S tried to "guess" the EH 
response by going through the TeP 
toire of response words condition 
during familiarization, the probabi” 
ity of his emitting the correct 
sponse was .75. A few Ss gave in£ 
dence of following this a? 
strategy. This strategy is often je 10 
by beginning readers when unab nce 
read a word such as “ate” in à sente 


a 


evi- 


7 
Worp-Assoctation TRAINING AND READING 


Б .o 

лер The boy ate his food.” Be- 
"abits gee learning of syntactical 
cam 2 пе student emits verbs. He 
of io в number of them until one 
tane гм the context of the sen- 
gubssin hough this appears to be a 
been’ procedure, it is not random 
fugato" but cued guessing. The 
maios] pi contextual cues and gram- 
cm. RM is to limit verbal re- 
Pobabilit those which have a high 
arly n - of being correct. Simi- 
unction 1s experiment, one of the 
Provide p of familiarization was to 
Probabilit cues which increased the 
response ne his guessing the correct 
tion was needed additional informa- 
c" арра на scores 
mine the e this experiment to deter- 
could be effect of guessing. Guessing 
time re „expected to increase the 
ince nio for word recognition. 
Cores for reatments with the highest 
ociated word recognition were as- 
recogniti with the fastest speeds of 
although S, one must conclude that 
igh wor d 5 could guess on the test, 
Primari] -recognition scores were not 

uy the result of guessing. 


8 
8 


REFERENCES 


Cham ; W: Teaching primar dii 
y reading. 
Gay: Ren Ill: Garrard Press, 1960. 
в ndianapoli mon sense in teaching reading. 
оома, K ч Ind.: Bobbs-Merrill, 1963. 
nd M K. 
~~ Imi : 
lish, es in reading. Elementary Eng- 
Levey 5, 42, 639-643. 
of "d Linguistics and the teaching 
1964, 9. New York: McGraw-Hill, 


163 


Mutter, С. A. Bruner, J. S, & Postman, L. 

Familiarity of letter sequences and tachis- 
toscopic identification. Journal of General 
Psychology, 1954, 50, 129-139. 

MUERL, S. The effects of visual discrimina- 
tion pretraining on learning to read a 
vocabulary list in kindergarten children. 
Journal of Educational Psychology, 1960, 
51, 217-221. 

O’New, W. M. The effect of verbal associa- 
tion on tachistoscopie recognition. Aus- 
tralian Journal of Psychology, 1953, 49, 
333-338. 

Косе, В. О., & Vernts, S. J. The effect of 
associative connections on the recognition 
of flashed words. Journal of Verbal Learn- 
ing and Verbal Behavior, 1963, 1, 300-303. 

Ruppert, R. B. The effect of oral and writ- 
ten patterns of language structure on 
reading comprehension. The Reading 
Teacher, 1965, 1, 270-275. 

Sorowox, R. L., & POSTMAN, L. Frequency of 
usage as а determinant of recognition 
thresholds for words. Journal of Experi- 
mental Psychology, 1952, 43, 195-206. 

Spracus, R. L. Effects of differential train- 
ing on tachistoscopic recognition thresh- 
olds. Journal of Experimental Psychology, 
1959, 58, 227-231. 

STRICKLAND, R. G. The language of elemen- 
tary school children: Its relationship to 
the language of reading textbooks and the 
quality of reading of selected children. 
Bulletin of the School of Education. 
Bloomington: Indiana University, July 


1962. 

TzuonxpixE, E. L. & Токов, І. The teacher's 
word book of 30,000 words. New York: 
Teachers College, Columbia University, 
Bureau of Publications, 1944. 

тох, E., & Gor», C. Stimulus informa- 
tion and contextual information as. de- 


1962. 
(Received July 7, 1965) 


Journal of Educational Psychology 
1966, Vol. 57, No. 3, 161-168 


CONCEPT DEVELOPMENT IN PIAGETIAN TERMS 
IN RELATION TO SCHOOL ATTAINMENT' 


P. S. FREYBERG 
Massey University 


Relationships between general intellectual ability, conceptual devel- 
opment and attainment in arithmetical computation, and arith- 
metical problem solving and spelling were investigated in a 2-yr. 
longitudinal study involving 151 New Zealand children aged 6-9 yr. 
Conceptual development, measured by a 72-item objective test, was 
more highly correlated with Primary Mental Abilities Test mental age 
(r — 523) than with chronological age (r — .116). Regression analysis 
showed that the inclusion of concept-development scores with mental 
age accounted for a significantly larger proportion of attainment 
score variance in most cases. It is suggested that children's school 
performance is associated with aspects of conceptual development 
not adequately assessed by conventional intelligence tests. 


Although attention has recently 
been directed (Braun, 1963) toward 
the relationship between children's 
ability to form concepts and their 
school attainment, comparatively lit- 
tle research has been reported which 
relates school attainment to develop- 
mental levels in conceptual thinking. 
'The efforts of Piaget and his Geneva 
colleagues have been concerned 
mainly with the ways in which con- 
ceptual thinking develops, rather than 
with the educational implications of 
ontogenetic changes in conceptualiza- 
tion. 

Piaget’s hypothesis of discontinu- 
ous stages in concept development is 
now comparatively well known, if 
not widely accepted, in English-speak- 
ing countries. Detailed expositions 
have been given by Hunt (1961) and 
Flavell (1963), and in translations of 
Piaget's (particularly 1950, 1952a, 
1952b) own works. It is sufficient to 
refer here to Piaget's contention that 


1This study is part of a larger longitudi- 
nal investigation of cognitive development 
in children aged 6-9 years, supported by a 
research grant from the New Zealand Uni- 
versity 
of A. E. Fieldhouse 
Wellington) and the 
concerned is gratefully acknowledged. 


profound changes in children’s 0a- 
pacities for conceptual thinking occur 
at about the ages of 7-8 and 11- 
years, He maintains that before n 
age of 7 years children cannot thin 
“reversibly” and so are not able 19 
form adequate concepts of classes pi 
lationships, and numbers, and «e 
until the age of 11 years or 50, yi 
concepts which they do form are А 
related to concrete situations that 
they cannot handle abstract PTO” 
lems. Although he recognizes the p 
fluence of environmental factors ^ 
the ontogenesis of conceptual thin A 
ing, his main emphasis has been 0 
maturational determinants. 

Some other research workers (e.& 
Bruner, 1960; Churchill, 19584) 
1958b) have suggested, however; 
it may be possible through ds 
of appropriate educational SR. 
to speed up the rate of concept hen 
velopment, and that this might Len 
allow the earlier teaching of imp? 
tant educational skills and К that 
edge. The assumption is made ar- 
some skills taught in school—P" ү 
ticularly those requiring abane, 
verbal thinking—cannot be atta e 
until the appropriate level 01, yë Я 
ceptualization has also been achie ple; 
While this assumption is reasona 
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Concept DEVELOPMENT 


th 
нн of the dependence of school 
Tight e on prior concept develop- 
Binet. S not yet been adequately ex- 
aoe “perimental studies which in- 
gation E need for further investi- 
(1961) have been those of Dodwell 
Consuet Hood (1962). Dodwell 
number ed a standardized test of 
respon du ey cor- 
and ordin a seriation, cardination, 
Ub with а 1on—along Piagetian lines, 
Possible greater objectivity than is 
methods b Piaget's "clinical" 
retest) о Aort-term stability (test- 
^ рабил with an interval 
r= 8], Wh was reasonably high at 
garten we ics the scores of 34 kinder- 
Pared wit} ren on this test were com- 
leg in 1 their subsequent perform- 
achieveme Simple test of arithmetical 
ater, th nt at Grade 1, 6 months 
Hood. correlation was .59. 
cept dep devised tests of con- 
lon of n opment involving conserva- 
Sbondenge СС and quantity, corre- 
compositio seriation, and additive 
ain Sirti: of classes and numbers, 
Periments, ar to some of Piaget's ex- 
Scores of 7 Hie compared the total 
With teac A children aged 4-8 years 
ical skija з ratings of the arithmet- 
stibstant a] their pupils and found 
*. Most al degree of correspond- 
1ш in the E those who were success- 
9 be maki lagetian tests were found 
ng reasonable progress in 


Dum 

insta work, while there was “no 

failure P Success in arithmetic and 
und in tl the Piaget tests being 
U 16 same child [p. 284]." 


nfort 
> t ese, UE the results of both 
Ardeq Vestigations can be re- 


Y on T 

к; hi as starting points: Dod- 

aoe 1, snd a of subjects was too 

€ ES of Tood relied on teachers’ 
her arithmetical attainment 


T e tha 
he Purpose on actual performance. 
of the present study, 
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therefore, was to investigate in 
greater depth the relationship be- 
tween children's level of concept 
development and their school attain- 
ment in three areas—skills in arith- 
metical computation, in solving arith- 
metieal problems, and, in the verbal 


field, in spelling. 
MzrHOD 


Subjects 


The subjects (Ss) of the investigation 
were 151 children (64 boys, 87 girls) from 
four schools in a relatively prosperous New 
Zealand inland urban center with a total 
population of about 42,000 people. Although 
socioeconomic differences between the areas 
in which each school is situated are not 
marked, such differences as did exist were 
taken into account in order to avoid bias in 
the sample. As some of the tests involved 
reading of answer choices (which were, how- 
ever, always read aloud to the testees), 
children who were, after a year’s schooling, 
complete nonreaders were excluded from the 
sample. The sample may thus represent a 
distribution slightly curtailed at the lower 
levels of intelligence, although the range of 
IQs was 77-133, with a mean of 104, on the 
Primary Mental Abilities (PMA) 5-7 year 
test (United States norms). 

The chronological age (CA) range at in- 
itial testing was 5 years, 9 months to 7 years, 


10 months, with a median of approximately 


6 years, 9 months. An experimental group of 
80 children was tested at three-monthly 
intervals over 2 years, while the control 
group of 71 children was tested only at the 


beginning and end of this period. 


Tests and Procedures 
jective test of concept de- 
first devised and evaluated 
n categories, with 


A 72-item ob. 


velopment was . 
(Freyberg, 1964). Hightee i 
four items to each category, included tests 


of conservation of quantity and weight, of 
dence, of additive com- 


and of con- 
pace, speed, age, kin- 


quired the chec. n 
both questions and answer 


d to Ss. 
For the age range concerned (6-9 years 
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TABLE 1 


INTERCORRELATIONS AMONG VARIABLES AT 
Finsr Test PERIOD 


чаны; | м& | Grew | RR 

CA 276 | .116 | .127 

MA (PMA 5-7) .923 .462 
Concept-develop- 

.418 


ment scores 


Note.—N = 151. 


inclusive) the test-retest reliability with 1 
week's interval was .911 (N — 99), and the 
internal consistency coefficient (split-half 
corrected) was .869. The validity of the ob- 
jective test as а group instrument was as- 
sessed by comparing the results from 17 
items with the scores obtained on corre- 
sponding Piagetian tests in a clinical situa- 
tion. The correlation was .722 with N = 101 
The attainment tests consisted of a 
speeded computational test of 120 items in- 
volving addition, subtraction, multiplication. 
and division (retest reliability 928, N = 
70); a 25-item test of arithmetical problem 
solving (reliability .887, N = 68); and 
22-item dictated-word spelling test (т i 
E m - ce A conventional pao’ 
of intelligence 
5-7 and 7-1, was dete m 
А e concept develo i 
intelligence fea were кор, ge p^ 
initial test period to all Ss, who were th n 
allocated within each school to perimental 
or control groups so as to secure sampl 
matched for intelligence. Differences e 
tween groups on the other variables proved 
to be statistically insignificant. The tests of 
computation and arithmetical problems were 
introduced into the test program midwa; 
through the series for the еса 
group, but were taken by the control group 
only once, that is, at the final test period. 
At the end of 2 years, owing to transfers 
away from the schools and absences at the 
final test administration, the number of Ss 
had dropped to 124. Significant practice ef- 
fects were observed in the experimental 
group with the tests of spelling, concept 
development, and intelligence, but not with 
the arithmetical tests. In examining the 
predictive relationships between concept de- 
velopment and arithmetic skills it was thus 
possible to utilize the results of both con- 
trol and experimental groups, but the con- 
current relationships at the final test period 
had to be determined from the results of the 


control group alone. 


P. S. FREYBERG 


Relationships between the variables were 
assessed by means of product-moment cor- 
relations, the significance of the contribution 
of concept development being estimated 
through the caleulation of multiple-correla- 
tion coefficients and regression analysis O 
variance. 


RESULTS 


The intercorrelations among vari- 
ables at the initial test period are 
given in Table 1. It will be observed 
that the correlation of concept scores 
with mental age (MA) is consider- 
ably higher than it is with CA. The 
CA range was not in fact so Te- 
stricted that it would have effec- 
tively prevented the demonstration 
of any significant, relationships; the 
youngest child was at that time ар 
5 years, 9 months, the oldest 7 years; 
10 months, and the quartiles were 
years, 5 months and 7 years, 1 month. 
The correlation of concept scores 
with CA when MA was held constant 
(r = .08) was not significantly differ- 
ent from zero, whereas the correlation 
between concept development an 
MA with CA held constant W^ 
r= .52. 

The multiple regression of concept 
scores and MA together on spelling 
scores produced a coefficient of T 
.504. Regression analysis of varianc? 
showed that the inclusion of conceP 
scores with MA accounted for 4% 0 
variance over and above the 21% 
accounted for by MA alone, 2 $ A 
but statistically significant propo", 
Eu (F = 820, df = 1/148; P 

The relative predictive value of 
tests of general intellectual oer 
and concept development 8$ ter 
arithmetic attainment 2 years u^ 
was concerned is demonstrate ре 
Table 2. The differences betwee? re 
first-order correlation coefficients the 
statistically nonsignificant, igh, 
coefficients are only moderately ior 
as is usual with predictive cale" 


Concept DEVELOPMENT 


Biss TABLE 2 

R 

gie BETWEEN COGNITIVE VARIA- 
AT INITIAL TESTING AND AT- 
TAINMENT 2 YEARS LATER 


Variable Arithmetic | Arithmetic 
computation? problemsh 
M 
МА (РМА 5-7) 543 557 
C cept scores 524 “571 
p MA/Con- US - 
Ent .612 .647 
^N = 121 
"N = 124. 
tions ; 
at this age level. Nevertheless, 


regressi 
р analysis ої variance 
Cases, c out the fact that, in both 
ina ee утте of concept scores 
to the on to MA added significantly 
he нш power of MA alone. 
are sh ative variance contributions 
т in Table 3. 
he ie gd data relating to 
making i. ion of spelling attainment 
trol grow se of the results of the con- 
Bree of р alone showed a smaller de- 
Po qu gian (r = Hur ЮК MA, 
Tees P ае Multi- 
nificant 58108 failed to produce а S18- 
ч. езе in predictive power. 
a DM quina was made oi the 
Nitive eee ia sian between cog- 
he final t attainment variables at 
Tesults cn period, again excluding 
ecause "e the experimental group 
tests, The practice effects in some 
Scores д e consideration of concept 
o of CELER for an additional 
for MA ariance, over and above 5296 
arithmeti val a multiple regression on 
significant problem performance— 
Scores — the .01 level. Concept 
ion to the no significant contribu- 
arithmet e variance in the case of 
ical computation or spelling. 


The Discussion 
Se r 
for iis a esults provide confirmation 
North ця of such British and 
анар ТО research workers as 
r (1955), Beard (19608, 
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1960b), Elkind (1961), and Feigen- 
baum (1963) that concept develop- 
ment is more closely linked to the 
growth of general intellectual ability 
than just to CA (or general matura- 
tional level). It is interesting to ob- 
serve that the reported correlations 
of concept development with MA 
have in general been higher than 
those with IQ, as one would expect if 
CA in fact played a lesser role than 
MA. At the same time it must be 
remembered that, taking the whole 
range of ontogenetic development 
into account, MA and CA are closely 
associated, and this would account 
for the (chronological) “age-related” 
sequence in concept development 
which has been observed in many 
other studies. In other words, MA and 
CA are only rarely independent vari- 
ables, except within a restricted age 
range, say less than 1 year. 

The fact that MA and concept- 
development scores show only a 
moderate degree of association does 
not appear to be simply the result 
liabilities or of the use 
of a comparatively narrow age range. 
s noted that some 


of his 55, both normal and intel- 
lectually re marked 
discrepancy b 
level of number-¢' 
ment, and that in no case 
at a low stage of conceptual develop- 
ment reveal high arithmetic ability, 


TABLE 3 
10N ANALYSIS or VARIANCE OF 


REGRESS! 
ARITHMETIC ATTAINMENT 
Arithmetic Arithmetic 
Source computation problems 
gcc S si [| 
MA .295 .310 
t scores 
be .080 .109 
F 16.00*** 21.80*** 
wt p < 001. 
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no matter what his MA. The present 
results are in agreement with this 
finding. 

It could be argued that the com- 
paratively moderate degree of associ- 
ation between MA and conceptual 
development results from the fairly 
low “conceptual” loading of the PMA 
Test. The same comment could be 
made regarding most of the group 
tests of general intellectual ability in 
common use at the present time, 
however, and this may well indicate a 
deficieney which should be remedied. 

It appears, then, that while the 
patterns of development of concep- 
tual thinking and of general intellec- 
tual ability may be similar, these pat- 
terns do not necessarily coincide. 
Moreover, children's school perform- 
ance seems to be associated with 
aspects of conceptual thinking which 
are not adequately assessed by con- 
ventional intelligence tests. Without 
further exploration it is not possible 
to say definitely whether faster con- 
cept development contributes to 
higher school attainment or is a re- 
flection of it, or, in fact, whether both 
are the observed result of the opera- 
tion of some third factor. But it can 
be stated, with assurance, that this 
relationship will repay further de- 
tailed study, and that it may well be 
one of the keys to future improve- 
ments in teaching programs. 
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Prais 
ear ee reproof on a classroom achievement test were adminis- 
university female students, both high and low achievers. 


ты of praise and тергоо 
ance on an independent variable. Results in 


f were evaluated in terms of per- 


dicate that praise 


has : "а> 
more generalized positive effects on performance than reproof. 


pad UEM 


eria studies have been con- 
and. тЫ 9 assess the effects of praise 
formant X on learning and per- 
inr. uM (1941) obtained 
ance mu effects between periorm- 
and ia 20 while Child 
impares (1959) emphasized the 
factor in e of level of aspiration as а 
lock’s (1 suecess and failure. Hur- 
praise зү 925) results indicated that 
condition clearly the more effective 
ance of b я improving the perform- 
achievin oth high-achieving and low- 
ieee en yet an obvious 
Gin, ae Bact Van De Riet’s (1964) 
dents à that academically low stu- 
denied rs perform better when 
While nor and worse when praised, 
dents w. mal or high-achieving Stu- 
Praised шя perform better when 
when р n suffer lower achievement 
а терруьа. 

k^ investigations (e.g. Steven- 
Knights 61; Stevenson, Keen, & 
acteristic, Ее had common char- 
Dlication TA suggested limited ap- 
Procedur of the results for classroom 
ment of 3i (cf. Hartup, 1965). Assess- 
Proof has he effects of praise and re- 
Specific 5 awaya been measured on à 
itio yf o oral task with no indi- 
hese R generalized effects of 
vation to s of reinforcement. Moti- 
In these an E the experimental tasks 
Deen БЕГЕ теў studies appears to have 
Sin dnos z limited by the reduction 
ossing m olvement in such tasks as 
Material arbles or learning nonsense 
Prior iny Another characteristic of 
estigations in praise and Te- 


Son, 


proof focused on the individual ad- 
ministration of the criterion variable 
to each subject (S) ; since the experi- 
menter (E), alone, was present with 
each S, no dyadic or triadic relation- 
ship could contribute to the variance 
of the praise and reproof reinforce- 
ment. Hurlock (1925) differed from 
other investigators in that the class- 
room became the focus of her research. 
Nevertheless, the experimental vari- 
able was an arithmetic exercise, and 
the children were brought individ- 
ually to the front of the classroom 
for praise and reproof; the effects 
were specifically measured by re- 
peated performance with the same 


variable. 
The design 
parts from previo 
area: Primarily, 
variable was manipu 


of the present study de- 
us research in this 
the praise-reproof 
lated in the con- 
d achievement 
test covering terial in 
а university 
reproof Were 
extreme scores and specially prepared 
remarks by the professor. Second, 
the achievement test was adminis- 
tered after the students had spent 
some time jn the course so that some 
foundation W ‘lable to the stu- 

sessing related levels of 

or aspiration. 


given in 


be made regarding the gen- 


eralized effects of praise and reproof. 
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From Hurlock’s (1925) and Van 
De Riet’s (1964) results, two hy- 
potheses were formulated for the pres- 
ent experiment. The first hypothesis 
corresponds with inferences from Hur- 
lock’s study: Hi—Praised students 
will perform better than reproved 
students. The second hypothesis cor- 
responds to inferences made from 
Van De Riet’s study: Hs—hReproved 
low-achieving students and praised 
high-achieving students will perform 
better than praised low achievers and 
reproved high achievers. A third 
hypothesis was also formulated uti- 
lizing Van De Riet's results within a 
framework of the level of aspiration 
(e.g, Child & Whiting, 1959): Н;— 
Praised students who estimate that 
their achievement scores will be rela- 
tively high and reproved students who 
estimate their achievement scores will 
be relatively low will perform sig- 
nificantly better than reproved stu- 
dents who estimate their scores to be 
relatively high and praised students 
who estimate their achievement to be 
relatively low. 


METHOD 


Variables 


Two tests were used in the experiment. 
The praise-reproof variable was an objective 
4-choice, 75-item achievement (ACH) test 
based on material presented in a 3-week set 
of lectures related to evaluation and meas- 
urement. On a larger sample of 136 cases, 
the ACH test had a split-half reliability of 
.63. An item appended to the ACH test re- 
quired the students to estimate whether their 
ACH score would be in the lower, middle, or 
upper third of the class. The second experi- 
mental variable, used to assess the general- 
ized effects of praise and reproof, was a 120- 
item multiple-choice test (COMP) of 
simple addition and subtraction of single- 
digit numbers, with all subtraction differ- 


ences being positive. 


Subjects 


The sample consisted 
university students enro 


of junior and senior 
lled in three sections 
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of an introductory course of educational 
psychology; one male professor taught two 
sections, while another male professor taught 
the remaining section. The selection of stu- 
dents for experimental analyses was based 
on several criteria. In a deliberate effort to 
obtain distinetly high and low achievers, the 
data for analyses are based on 52 females, 
selected from the total number of 128 stu- 
dents, who were present for all procedures 
and who scored either in the upper or lower 
third of the total ACH-test score distribu- 
tion; absences, either from one of the 
COMP or the ACH-test administrations, re- 
duced the sample size. Moreover, since & 
limited number of males were represented in 
the upper and lower thirds of the ACH score 
distribution (9 and 12, respectively), and 
ample evidence of possible sex differences 
had been reported (e.g, Schmidt, 1941; 
Stevenson, 1961), males were eliminated 
from experimental analyses. 


Procedures 


The students were administered the 
COMP in a 4-minute session, under 
auspices of a math-test standardization pro- 
gram, 1 day prior to the administration 9 
the ACH test. The COMP test administra- 
tion was conducted by a “neutral” faculty 
member relatively unknown to the students. 
Four days after the ACH test, the students 
were given sham ACH scores. All females 
designated for the praise condition 1n 
experiment were assigned random 808 
scores greater than 65, and those designate 
for the reproof condition in the experimen 
were assigned random sham scores O less 
than 20. The sham scores were written РЕ 
the inside of slips of paper that were folde 
once and stapled. The student's name T 
written on the outside, and the students were 
told periodically to refrain from апу 1 
cussion, particularly of test scores. After ps 
scores were handed out, the professor ma 
prepared remarks emphasizing th 
three students had scored much, 
(above 60) than anyone had scored in 2". 
than 5 years of administering this test» 
one or two others had scored well 
chance (below 20): “...so you can 806». 
am at the same time very elated and som. 
what embarrassed for a few of you 1) © ie 
class." Immediately after the professor 8 2 
marks, the COMP was administered aoe 
to the students by the neutral faculty Tei 
ber. Following the completion of all tes Sd 
the purpose and details of the experim 
were explained to the students. 


ErrEcrs or PRAISE AND REPROOF 


Analyses 


Th " . 
Бу ses and inferences are based on 
The каре percentages of COMP scores. 
experiment sizes in the eight cells of the 
ual сорап much too small for individ- 
ever, were pa The three hypotheses, how- 
the cells i anned for various groupings of 
Wa hogonal conti m" 

rds, 1963, pp. 140-144). ntrasts (eg, Ed 


RESULTS 


The pr 
meee and post-COMP scores 
tively with ACH scores, respec- 
tion te 07 and —.09. The correla- 
Scores Vice pre- and post-COMP 
e — ри The COMP, then, may 
due qund as à highly сепара vari- 
ines padent of the ACH test. 
Мотвоуег, the pre- and post- СОМЕ 
66.94 eos respectively, 60.31 and 
ard devi hile the corresponding stand- 
x duas were IM and 9.86. 
Professor h sections taught by one 
0.85 a I an ACH-test mean 0 
7.25; Foe a standard deviation © 
e eie corresponding statistics for 
6.60 "maining section were 41.65, i 
significantly ( means did not differ 
and lowe y (t = .606), so the upper- 
mined np ey groups were deter- 
Students е ACH scores for all 128 
groups h Ts upper- and lower-third 
71 Mec, means, respectively, 9 
Standard q 35.11, and corresponding 
le mean eviations of 8.38 and 888; 
With р 208 were significantly different 
© 7.14.4894 (p < 001). 
Scores for remental percentage COMP 
Sented in the various groups are pre- 
artlett’s Neues 1. A conventional 
ances yie] bee on the within-cell vari- 
indicano ded a x° of 9.8455 (р < 30), 
ances, wage mogeneity among vari- 
хасу hile the sample sizes аге not 
differ , Droportional, the Ns did not 
9 6.5 x ificantly from the average 
B va 7.6923; p < .50), so the 
as used in an approxima- 


tion 
Met! 
hod of analysis of variance 
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TABLE 1 
SAMPLE SIZES AND PERCENTAGE COMP 
SconE MEANS AND STANDARD 
DEVIATIONS FOR GROUPS 


Groups 
Low | Low igh i 
Independ Е Г h 
панцер KE | ЖЕ | BOE | AR 
low high low high 
ACE ACH ACH | ACH 
esti. esti- esti- esti- 
mate mate mat mate 
N 5 5 11 6 
Praise M |114.40122.80 110.91]118.33 
SD |13.5 | 13.7 10.8| 7.7 
N 4 4 10 7 
Reproof | M 107.00/106. 50/108. 60 108.14 
SD | 5.5 | 11.4 7.8 | 11.4 
(Johnson, 1949, pp. 261-266) and in 


the orthogonal comparisons. 
The analysis of variance for among- 


cell variation resulted in an F value 
of 1.7067 which was not significant 
> .05), but this does not vitiate 
Janned orthogonal comparisons. The 
standard error for each of the three 


contrasts is: 


/(8/6.5) (133.224) 


value is evalua 


= 12.81, 
ted with 


the first hypothesis, 
groups are higher than the reproved 
кеа in = 259 (p € 


ing to 
pante rd to estimated ACH scores, 


With rega 
ihe third hypothesis states that the 
praised esti ated-high-ACH groups 
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and the reproved estimated-low-ACH 
groups will perform better than the 
reproved estimated-high-ACH groups 
and the praised estimated-low-ACH 
groups. For the third hypothesis, 
t = 1.8099, which, while not signifi- 
cant (.10 > p > .05), is at least in the 
predicted direction; the third hy- 
pothesis, however, is rej ected. 


Discussion 


The present study represents an 
attempt to measure the generalized 
effects of praise and reproof on a 
cognitive variable within the classroom 
context. A greater increment in per- 
formance was realized under praise 
than under reproof. The results sug- 
gest that praise and reproof in one 
subject-matter area in the classroom 
may well transfer, in terms of effects 
on performance, to other classroom 
behavior that is entirely distinct and 
independent. In agreement with con- 
clusions of Hurlock (1925), present 
data supported the traditional cre- 
dence that praise of student behavior 
will generally improve performance 
more than methods of reproof. 

Van De Riet (1964) obtained an 
actual reduction in performance for 
praised underachievers and reproved 
normal achievers, yet no such reduc- 
tion was obtained in the present study, 
but there may be several reasons for 
this difference. The samples in her 
study differ from those of the present 
study in terms of age, sex, and 
achievement classification; the sam- 
ples herein are merely defined as re- 
latively high and low achievers with 
respect to a single classroom variable. 
Although the effects of praise and re- 
proof were measured in both studies, 
Van De Riet focused on task-specific 
effects, while the present investigation 
was directed toward generalized ef- 
fects. Moreover, for the present study, 
students in all of the classes were 
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“contaminated” by the experimental 
procedure so there was no way to es- 
timate the average increment in per- 
centage of COMP scores without the 
intervening praise-reproof variable, 
and only relative comparisons are pos- 
sible in the present study. 

The level of academic expectancy 
was a unique variable that had not 
been considered in many previous 
studies of praise and reproof. Success 
and failure, as might be constituted 
in praise and reproof, are generally 
recognized as having differential 
effects (e.g., Child & Whiting, 1959), 
and Kunst (1959) has even postu- 
lated that low achievers have a “need 
to fail. The results of the present 
study indicate that the level of aca- 
demic expectancy may be an impor- 
tant factor in terms of reinforcemen 
and performance. From an examina- 
tion of the COMP means in Table iy 
it is apparent that praise and reproo 
have the largest differential effects on 
those students (both high and low 
achievers) who estimated that they 
had maintained relatively high aca- 
demic performance on the achieve- 
ment test. Such differential effects, 
however, were not so pronounced f0! 
students who estimated their ре?" 
formances to be relatively low. These 
differences in performance were 09 
significant in the present study, but a 
more thorough study of such differ- 
ences might prove profitable in fut"! 
research. 
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EMOTIONALLY LOADED WORDS AND THE ACQUISITION 
OF A SIGHT VOCABULARY 


DAVID R. OLSON! anb А. S. PAU 
Dalhousie University 


A group of 33 primary children were taught to read a list of 6 words, 
3 of which were empirically classed as high-emotional words, and 3 
classed as low-emotional words, in 2 independent experiments. The 
subgroups of words were matched for length, frequency, and gram- 
matical function. Training was done by the anticipation method with 
correction after each trial. Children required significantly fewer trials 
to learn the high-emotional words. This finding is predictable both 
from psychological theory and from the observations of insightful 


teachers such as Ashton-Warner. 


An observation of some significance 
to education has been made by Sylvia 
Ashton-Warner (1963) in her recent 
book, Teacher. She observed that 
Maori school children were often 
completely impervious to the static 
Janet and John reading program. 
She found that she was able to solve 
the reading problem by selecting for 
each child a set of words of intense 
feeling and meaning; they quickly 
learned to read such words as ghost, 
fight, kiss, and mommy, presumably 
because they reflected the basic needs, 
fears, and concerns of the children. 

Her views have a strong affinity 
with the writings of John Dewey on 
the role and nature of interest in 
education. Dewey (1913) argued that 
it is impossible to first select subject 
matter and later make it interesting. 
Subject matter must be selected in 
terms of the child’s urgent needs and 
interests. 

The purpose of the present study 
was to recast this view into a psycho- 
logical model and to verify it em- 
pirically. What are the effects of 
emotional loading on the acquisition 
of a sight vocabulary? This problem 
may be considered in terms of the 
interrelationship between motivation 
and cognition. Extensive critical re- 


1 Now at the Center for Cognitive Stud- 
ies, Harvard University. 


views of the experimental studies coD- 
cerning this relationship have been 
presented by Allport (1955), Chaplin 
and Krawiec (1960), and Henle 
(1955). Representative of the studies 
reviewed was that of Bruner a? 
Postman (1948) who had subjects 
(Ss) judge the sizes of disks inscribe 
with either positive, negative, ог ue 
tral symbols. The  positive-value 
symbol (a dollar sign) was JU ge" 
larger than the negative-valued sym 
bol (a swastika), but both of these 
were judged as larger than the po 
tral one. They concluded that what 
ever is important to S, whether m 
tively or negatively, was likely to а 
exaggerated in his perception. Tol- 
concept-identification situation, d 
son and Harper (1963) presente 
evidence that children do not respon. 
exclusively on the basis of cognitiv, 
input; the reactions were modifie 7 
the motivational significance of E {- 
material to be learned. Hence, 0 oA 
ferent contexts, emotional significant” 
or affect has either a positive a 
negative influence—it is rarely, PO 
ever, irrelevant. 

Another line of eviden 
from the studies of factors aff s n5 
the formation of verbal associato - 
Meaningfulness has repeatedly pk 
shown to facilitate this process” 


ce comes 
ectin£ 


Jen- 
* For example, Palermo, Flamer, and 
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Learning Emorionatty Loapep Worps 


cen 
эы з (1962) has shown that 
eem ity is a factor independent 
айп ingfulness as defined by asso- 
with hs ud His finding is in line 
study. K position developed in this 
он goes оп to speculate that 
communi ity may act as "noise in à 
tion eee channel,” a specula- 
other studi y at variance with the 
Ray s les summarized here. 
theory ge of this paper, the 
Perhaps y be stated thus: Except 
stress iie occasions of extreme 
loading T affective or emotional 
accessible i stimulus makes it more 
ing a set о the organism by provid- 
(1955, p ie expectation. As Henle 
under the i 6) stated, “It seems that, 
ude, we influence of a need or atti- 
which ч are attuned to events to 
sensitiva Ж, not otherwise be 
tive influe his set will have а posi- 
18 centra] y when the cue activated 
Influence 1 d the task, and a negative 
tracts «e attention to this cue de- 
of the is е the genuine purpose 
Vigilance» ‚ Since, in our study, any 
loaded m toward the emotionally 
task, the aterial was relevant to the 
tional i S tiom followed that emo- 
Ore aou facilitates learning. 
that kin 3» cally, it was hypothesized 
Shi е children learn to 
iind Which have an emotional 
Keri ess quickly than neutral 
onal words. 


Signi 
or n 


king 
(19 
nonsense ЫП Moned that high-meaningful 
i ociate task les were learned in a paired- 
n ul nonsen more easily than low-mean- 
Sade children. syllables for both fifth- 
s parent anui апа adulta: Although the 
[s i Meanie tion, is somewhat bother- 
xis he ers ulness is obviously relevant 
aningfulnens i study. The definition of 
uj ations ss in terms of the number of 
lous given per minute provides the 


f a wo ОЗЫ 
а “ye d a A that the meaningfulness 
basis се” basis increased by paying Ss on 
. » rather than ор an hourly 
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METHOD 


Design 

The overall design of this study involved 
teaching a group of primary school children 
to associate a phonological reading re- 
sponse with each of a set of visually pre- 
sented words, one-half of which were Emo- 
tionally Loaded and the remaining one-half 
Nonemotionally Loaded or Neutral. The 
method employed was the anticipation 
method with correction after each response. 


Subjects 
were 9 boys 


Ss for this investigation 
and 9 girls from the Primary Grade at Le- 
Marchant School, Halifax, Nova Scotia. All 
children were either 5 or 6 years of age, 
and none of them could read any of the 


words used in the experiment. 


Materials 


The materials 
consisted of & lis 
classified as either 
or Nonemotionally (N-E 
structing this list, 4 
jectives were selecte 


Lorge (1944) word li: 
that they be words having frequency rat- 


ings of 100 or more per million, and suita- 
ble for children at the primary Jevel. These 
46 words were presented to 30 sixth-grade 
children who were to classify them as: (@ 
“Words I love"; (b) “Words I hate"; or (c) 
«Words that don't matter." Only those 


words which consistently fell into the same 


category Were used in the 
in the first two categories 8 
E words, and those in the third аз N-E. 
The categorization of these words was then 
cross-validated Leer them to а sm 

group of first-grade children к 
sort these into the same three с asses. This 
procedure results obtained 


duplicated the 

by the earlier method. Au 
On the basis of pilot trials, it was con- 
list of six four-letter words 
for primary children. The 
d in block letters on 3x 
f the words was random- 
ized; er, to inimize the effects of 
ized; ft hildren's learning, two sets of 
set randomized inde- 


used in this experiment 
t of words which were 
Emotionally Loaded (E) 
) Loaded. In con- 


which coul n be presented in & cyclic 

;dren. The words utilized 
h the percentage of 
— 30) who rated them as 


children 
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TABLE 1 


Trias REQUIRED FOR LEARNING OF 
EMOTIONAL AND NONEMOTIONAL 


Worps 
Emo. | Trials 
Class & word tional | ———4 * 
rating | M | SD 
Emotional | 
GOLD 90% | 3.6 
BURN 93% | 3.9 
KILL 96% | 4.4 
Combined mean 3.7 
Total 11.9,3.61 
Nonemotional 
HIGH 13% | 6.0| 
MANY 3395 | 5.6 5.09* 
HALL 3695 | 8.8 
Combined mean 6.8 
Total 20.46.38: 


*p < 001, df = 15. 


Emotional are presented in the first two 
columns of Table 1. 


Procedure 


The 18 children employed in the study 
were individually tested by the experi- 
menter (E) in a private room in the school. 
As soon as a reasonable degree of rapport 
was established, the child was seated at a 
table facing E. Ё presented the word cards, 
which had been randomly ordered, one at a 
time in the palm of his hand. As each word 
was presented the child was asked to read 
the word; if he could not, Æ pronounced 
the word and requested that the child re- 
peat it. E simultaneously recorded the read- 
ing response (right or wrong) on a form 
provided. The first time through the list of 
words, Æ also asked if the child knew what 
the word meant. Virtually all the children 
used in this study could use the words in 
sentences; on the occasions when they could 
not, E provided an explanation until the 
child signified some understanding. 

Words were presented in a cyclie fashion 
with no stops between the presentations of 
the list. Learning continued until the cri- 
terion of three consecutive correct responses 
was given to each word in the list. Data 
for use in the analysis consisted of the num- 
ber of trials each child required for each 
word before he responded correctly to it 


for three or more trials. 
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RESULTS 


Of the results from the 18 Ss in- 
volved in the experiment, only 16 
were used in the analysis. Both of the 
excluded Ss failed to make any 
progress on the task before they 
showed signs of fatigue. 

The mean number of trials re- 
quired for the mastery of each of the 
six words employed in the experiment 
with combined means for each of the 
classes, Emotional and Nonemo- 
tional, are presented in Table 1. Since 
each S served as his own control, à 
t test for the significance of the dif- 
ference between correlated means for 
the Emotional and Nonemotional 
words was conducted. Raw scores for 
this analysis consisted of the total 
scores of individuals for E words an¢ 
N-E words. The resulting Ё is also 
presented in Table 1. An examination 
of Table 1 shows clearly that the 
children required significantly fewe” 
trials to learn to read the Emo 
tional words than Nonemotional 07 
Neutral words. There was no арра" 
ent difference between the positive 
and negative words within the Emo- 
tional class. 4 

Although the results of this exper" 
ment are striking, the factor РГО 
ducing the effect cannot be attribute" 
with absolute certainty to the inde. 
pendent variable, Emotionality, ме 
two reasons. First, it is possible he 
nouns and verbs are learned mOr 
easily than adjectives; the lis g 
words used in the experiment was A" 
controlled for part of speech. Es 
ond, two of the words in the None 
tional class begin with the letter ^" 


is possible that this redundancy om 
plicated the discrimination O these 


two words. In order to eliminate 
contingencies, the study was 
cated. 


тер)” 


LrEanNING Е х 
NG EMOTIONALLY Loapep Womps 


А REPLICATION 


The replicati 
is аа of the experiment 
with the ed in a manner identical 
children already described. The 
15 5- an -ployed in this study were 
from St, ht en primary children 
the an „поша Aquinas School in 
in which Am of Halifax as the school 
П order t first study was conducted. 
ase of d о broaden the empirical 
Dew word he conclusion, six entirely 
Cation TI were utilized in the repli- 
lected to nese new words were se- 
M the ер. the criteria established 
imitations section, as well as the 
9f words s mentioned above. The set 
along Sea een for this replication 
= 30) r percentage of children 
al are a ho rated them as Emo- 
"sm resented in Table 2. 
that s M of Table 2 shows 
Stitute nod the words which con- 
Not Overt: experimental classes did 
ords eae it was impossible to get 
onemotion extreme Emotional and 
all the ee ratings and still meet 
di кану eee discussed earlier. This 
е Type pe the possibility of 
the probani that is, it staggers 
Sis in eee against the hypothe- 
e 
e SP edd of trials required for 
Class jg "iod of the words for each 
in Taliep r presented and analyzed 
Mastery oj This table shows that the 
Significanty Neutral words required 
поба |у more trials than did the 
n ings y Loaded words. These 
after Фет the conclusion drawn 
original experiment. 


tion 


Co 
NCL 
USIONS AND IMPLICATIONS 


This 
Ene ШЕ has provided empirical 
ological hat children can learn pho- 
(ona E responses to Emo- 
a to N s more easily than they 
Sing it тапете сози words. In 80 
has provided some evidence 
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TABLE 2 


TRIALS REQUIRED FOR LEARNING OF 
EMOTIONAL AND NONEMOTIONAL 


Worps 
Trial: 
Class & word teal = t 
rating | у | SD 
Emotional 
HATE 96% | 5.5 
WISH 93% | 6.3) 
KISS 73% | 3.6 
Combined mean 5.1 
Total 15.34.33 
Nonemotional 
COME 23% | 8.9 
KEEP 43% | 8.5) 4.92* 
BUSY 56% | 9.8 
Combined mean 7.7 
Total 23.70.64 
жр < 001, df = M. 
ational 


for the importance of motiv: 
factors in learning. Furthermore, it 
substantiates what Ashton-Warner 
(and other good teachers) have known 
all along, namely, that the degree of 
learning is & function of the signifi- 


cance that the material has to ihe 
child. The study also indirectly adds 


to the validity of the concept of in- 
trinsic motivation—motivational fac- 
tors which arise from within the 
child. It leaves unanswered, however, 
the interesting question of the rela- 
tionship between meaningfulness and 


emotionality. Р 
The hypothesis tested in ihis ex- 
periment received unexpected con- 
firmation when one E, upon entering 
the school, observed 2 child emerging 
training administered by 

E. E asked the child what 
. He replied, "I 
some words." When 
he words were, the 
replied, Наќе..., 
Wish. -> and som 
don't remember." T 
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were not only more difficult to learn 
to read, but they were also forgotten 
more quickly. . 

Most children in our society, espe- 
cially in the middle class, are suffi- 
ciently motivated by their desire to 
read that they make satisfactory 
progress even with relatively neutral 
material. Nevertheless, highly emo- 
tional words may act as a bridge for 
the nonreader or the poor reader to 
enter into the reading world. The 
minimal implication of the study 
is that teachers and researchers must 
not consider children as neutral or 
unstructured intellectual machines 
passively waiting to be programed. 
The secret to gaining access to the 
child’s mind is through the use of 
words and ideas that are important or 
significant to him, that is, through 
ideas to which he has already ac- 
quired an emotional reaction. 

Further studies could be conducted 
to examine the effects of situational 
or induced emotionality on learning. 
Perhaps it is possible to vary the 
emotional loading of certain concepts 
by introducing them in a certain way. 
This would be contrary to the 
Dewey-Ashton-Warner position that 
interests must be discovered and ex- 
ploited and not merely laid on. Such 
evidence would provide some under- 
pinning for the more orthodox educa- 
tional practice of beginning a lesson 
on the prescribed material by writing 
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into a lesson plan a small section 
called “motivation for the lesson.’ 
This study has been an attempt to 
demonstrate that the relationship be- 
tween motivation and cognition 1 
an educational context is an impor- 
tant and researchable issue. 
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EFFE T 
FECT UPON TRANSFER OF VARIATIONS 
IN TRAINING CONDITONS' 


І D 
ROBERT L. R. OVERING R Г 
г АЫ OBERT М. W. TRAVERS 
1 Western. Michigan University 


AND 


An experi 
(a) сынду conducted to investigate the effect upon transfer of: 
Y К calde бы ^ ant information while holding relevant information 
аса Ц ааыа —— a set, and (c) verbalizing knowledge 
| Gand 7, were ami pun: es with an IQ range of 110-130 in Grades 
iy exch oe gned to ifferent conditions of learning. уе of the Ss 
рна eni ai were asked to verbalize the knowledge ac- 
шыгу 5 "m to solve 2 problems involving aiming at an 
training in b ages . Results suggest that transfer is facilitated by: (a) 
see selene d presence of irrelevant information, (b) establishing a 
g the material to be learned to а practical situation, and (c) 


verbalizati 
lization of knowledge prior to its application. 
ee 


Previ ' 
t бей experiments (Hendrickson 
ditate qpa r 1941; Judd, 1908) in- 
Principle в verbal knowledge of а 
he Bnet. a facilitating effect on 
Situation y of transfer evident in a 
Principle mega. applieation of that 
further sl lendriekson and Schroeder 
transfer стар that the amount of 
n ormation related to the amount of 
mation n taught, that is, the less in- 
Vena 7 the less transfer, and vice 


s, kd these previous experi- 
of iere investigated the ef- 
rent нр: the principle by 
e !éporte * In fact, in the experi- 
th 9rmed at d by Judd, we are not 
© inform a of the manner in which 
ee a making up the inde- 
lae ы эы, was taught, or, for 
ex ght. We s What information Was 
tye timental re simply told that the 
стена] group was given a full 
BoC subj explanation of refraction 
ба), owe were Grade 5 and 6 
hile the control group received 
Thi 

Sora 215 article ; 
d = is based on research spon- 
; United por Educational Media 
he toject С tates Office of Education, 
"cedures Каш Complete details of 
iso P ee can be found in а 
cludes thi on by R. L. R. Overing 

s and other studies. 
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no theoretical training. As a test for 
the effect of this knowledge on trans- 
fer, all subjects (Ss) were then tested 
in a series of tasks requiring them to 
throw darts to hit an underwater tar- 


get. 
Hendrickson an 


1941 experiment, W 
fied reproduction of Judd's, do de- 
scribe both the method by which they 
taught the information and the infor- 
mation itself. Essentially, each S was 
presented with a brief printed state- 
ment and a diagram representing 
(somewhat inaccurately) the refrac- 
tion of light in water and the resulting 
apparent displacement when an under- 
water object is viewed at an angle 
from above. Each S was permitted to 
study this information until he stated 
that he understood it. As in Judd's 
experiment the control group received 


no instruction or information relating 
displacement. Like 


to refraction OT 
Judd's, also, the transfer task again 
was to attempt to hit an underwater 
target, only in this case an air rifle 
and pellets were substituted for the 
darts. os . 
The experiment which is described 

was an attempt to ex- 


d Schroeder in their 
hich was a modi- 
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what the effect upon transfer might be 
of introducing variations in training 
conditions, where relevant information 
and time were held constant. The di- 
mension varied was essentially that of 
irrelevancy; for example, in the “re- 
alistic" treatment, all sorts of irrel- 
evant cues were present, whereas in 
the "visually compressed" treatment 
an effort was made to eliminate many 
irrelevant bits of information by re- 
ducing the visual aspect of the pres- 
entation to a series of line drawings. 
'The choice of treatments arose from a 
survey of research and theory related 
to audiovisual information transmis- 
sion recently completed at the Univer- 
sity of Utah by Travers et al. (1964). 

In Travers' monograph it is pointed 
out that evidence exists to indicate 
that irrelevant information may inter- 
fere with learning (e.g., Hunt, 1962). 
This viewpoint is backed by neu- 
rophysiological and other evidence 
which suggests that the human senses 
and nervous system compress informa- 
tion into simpler forms (Attneave, 
1954; Cherry, 1962; Hubel, 1963). 
Thus the transmission of information 
should take place most effectively 
under conditions where the informa- 
tion to be transmitted has been pre- 
compressed, either by the elimination 
of redundancy and/or the elimination 
of irrelevant information. 

However, the possibility also exists 
that conditions of compression, while 
facilitating transmission of informa- 
tion, may not be optimum where 
transfer is called for. In other words, 
the optimum conditions for learning 
a concept may not be the optimum 
conditions for transfer related to that 
concept. Our research, the main de- 
tails of which are reported below, is 
an attempt to cast some light on these 

'oblems. 

r- the presumption that S's knowl- 
edge of a principle tends to raise = 
level of performance 1n à transfer tas 


Бовевт L. В. Оуєвіха AND RoBERT M. W. Travers 


requiring application of that principle, 
the experimenters set out to deter- 
mine: | 

1. Whether knowledge of the prin- 
ciple of refraction acquired under 
visually compressed or under realistic 
training conditions would best facili- 
tate transfer. 

2. Whether verbalization of the 
knowledge taught would facilitate per- 
formance on the transfer task. 

3. Whether contact with а relevant 
problem immediately preceding 12- 
struction with visually compresse 
materials would establish a set, thus 
facilitating performance on the trans" 
fer task. 


METHOD 


Experimental Design 


The experimental design was & 2 x 2 х 
4 X 2 X 12 factorial design comprising о 
Verbalization-Nonverbalization I тё- 
sexes, 4 training conditions, 2 tests, ат! 
peated measures within each test. 


Training Conditions 


Я а 

Four training conditions were introdusa 
into the experiment. The attempt was n but 
to teach all groups the same informa Онол 
to vary the way in which the inform hing 
was taught. The time devoted to ies ] in 
was the same for all groups. The ver e 
formation provided was reproduced in four 
case by means of a tape recorder. The 
teaching conditions were as follows: . con- 

1. Verbal Information Only. In thi ded 
dition Ss listened to a tape which pro" ript 
all the information they received. те ction 
described: (а) the principle of the ге oa i 
of a light ray entering and leaving "v dis- 
and (b) the phenomenon of the Y 
placement of an underwater objec! ime 
viewed at an angle from above. 
taken to run this tape became the 
used in all teaching procedures. 
of the tape also became the basis · 
materials used in all other teachin 
tions. tion 

2. Visually Compressed Informati "ma 
Verbal Information. In this ene 
dition line drawings were introdu e 
transmit some of the information а® vog. 
of the redundant items of the ec? pro" 
bal script were deleted. The drawln 


g cont 
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vided А 
M eed bem representations of the 
to facilitat iscussed, but colors were used 
the dra, a e the identification of parts of 
Every eff ngs from the verbal descriptions. 
décor was made to ensure that the in- 
eachin, content of this and all other 
3. Rs procedures was identical. 
Irma Demonstration. plus Verbal 
volved EUR This teaching condition in- 
Passing i isual presentation of a light beam 
ш Б the water in a 2-gallon fish 
on the Uu beam was reflected by а mirror 
e Pise of the tank and also could 
of light 8 as it left the water. The beam 
Strip proj as produced by а 500-watt film- 
Diner arranged to produce an ap- 
Was 25 inc parallel beam of light which 
it could f in diameter at the place where 
Prove the ХЕР be observed. In order to im- 
chalk dust visibility of the beam of light, 
eraser oy was knocked out of a blackboard 
Pupils n the tank. In this condition the 
light fp ere free to observe the beam of 
n add) different angles. 
Stration ari pupils were given a demon- 
displacem the phenomenon of the visual 
volved pe of an underwater object. This 
and S, E. identieal tanks side by side, 
added to ere shown that when water was 
Position Oba of the tanks, the apparent 
rove s a coin on the bottom appears to 
Upils we, ards and also away from them. 
Coin in ae shown how the position of & 
Changed to Parallel empty tank had to be 
relative ро make it appear to be in the same 
Partly fille "ped as a coin in the similar tank 
: View, water. 
Verbal Trt, Compressed Information plus 
9 Problem cu Preceded by an Orient- 
fing same as his teaching condition was 
St giy Condition 2, but pupils were 
the р еп a problem t Н "n 
et, Tactical implication: point out some © 
told ction. In a cations of the principle of 
Cro the story ae problem the pupils were 
к è river, Ъ of an explorer who had to 
20 crocodil ut who first had to shoot and 
Som ds feet below the surface and 
on е to indi e explorer. The pupils were 
p], the “ез on a diagram the point 
тег shoul, ^u of the water where the ex- 
d their es Pupils were not tol 
he Purpose lutions were right or wrong, 
Nome, Praetieg] was only to focus attention 
eno al problem involving the phe- 
„Бан of retraction " 
whig requi the Ss of each training condition 
ich puted to sj : 
the S! their t undertake a written test 1n 
information was to verbalize aspects © 
tion toceeded disi en: The other half of the 
8. directly to the testing situa- 


In the case of some of the groups of Ss 
run early in the experiment, every fifth S 
was assigned to a control group that was 
given no training, but proceeded immedi- 
ately to the testing situation. The function 
of this group was to determine whether 


training did produce learning. 


Test Conditions 

The tasks used to measure differences in 
the amount of transfer resulting from each 
of the four training conditions involved the 
activity of shooting at а target under 
water. The task was similar to that used in 
the previous studies of Judd (1908) and 
Hendrickson and Schroeder (1941), but cer- 
tain important modifications were made. The 
testing situation involved a mock pistol 
mounted on a stand in such a way that only 
the angle of elevation could be varied. The 
target object was immersed in a large tub 
of water, and the surface of the water in 
the tub was covered with a rigid transpar- 
ent plastic sheet on which was marked a 
numbered grid. The pupil in the test situa- 
tion did not actually have to shoot, but was 
required only to indicate the numbered line 
on the grid at which he was aiming. i 
procedure was designed to eliminate differ- 
ences in the ability to aim and shoot а gun, 

tributed substantially 

to the error term in the subsequent analysis. 

Each imilar tests 
which differe 
in the secon i 
with 5 inches), 
a more oblique viewin, 
water. In the first test the lower edge of 
the tank was 31 inches horizonta: 1 
gun stand, and the gun itself was 39.5 inches 
above the floor, but in the second test the 
horizontal distance was increased to 47.5 
inches and the vertical height to 48.75 inches. 


second test also involved & different 
келү" and was administered 


t and tub 
shaped tare" han the first test. It 
ifficult task 


structed before the tests 
reported their point of 


that each time they г i t 
aim they would be told either “hit,” “miss,” 
” They were also told that a 


or “bullseye {еу 
p meant that the. А 
ЪшГв-еуе the target, while a hit meant 


had struck some 


minated. 
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The Ss were tested individually, and, 
while each was asked not to give away the 
answer to other pupils, some effort was made 
to arrange the procedure so that the pupils 
returned to their classes after the testing 
where little opportunity to discuss the ex- 
perience was provided. In one of the early 
tryouts of the materials, leakage of infor- 
mation concerning the correct place to aim 
the gun was identified as a serious problem. 

The small group of Ss run as a control 
group without any training was given special 
instructions as follows: 

Underwater objects, when viewed at 
an angle from above, do not really lie in 
the exact position that they appear to 
lie. Knowing this fact, your task, now, 
js to aim the pistol so that a bullet from 
it would strike the underwater target. 


Subjects 


The Ss (N = 215) were fifth, sixth, sev- 
enth, and eighth graders drawn from three 
public schools in Salt Lake City, Utah, as 
follows: 

1. Stewart School (campus demonstration 
school, K-IX) Grade 8, 40 pupils; Grade 7, 
40 pupils; Grade 5-6 (combined grade), 32 
pupils. 

2. Wasatch School (middle class, K-VI) 
Grade 6, 39 pupils. 

3. Whittier School (lower middle class, 
K-VII) Grade 7, 32 pupils; Grade 6, 32 
pupils. 

'The Grade 7 and 8 pupils from the Stew- 
art Demonstration School (N = 80) served 
as pilot subjects, and 8 of them provided the 
control group given no training. Their test 
results suggested several modifications in 
the experimental conditions, namely, the 
importance of controlling for information 
leakage among Ss, and the desirability of 
increasing the difficulty of the second test 
of transfer. In addition, the pilot group 


served to show that differential amounts of 
learning were occurring among training con- 
ditions, and that, therefore, the project was 
worth pursuing. 

Of the remaining 125 Ss, one complete 
block of 32 was dropped (Whittier School, 
Grade 6) since their scores indicated that 
no learning had taken place. Although no 
definite reason for their chance-level scores 
can be given, the authors hypothesize that 
the lack of learning was due to the un- 
sophisticated level of the Ss, and the high 
anxiety level which appeared to be gener- 
ated among them by having to take part in & 
mysterious activity which involved, аторв 
other things, being called down to the prin- 
cipal's office (the rendezvous point), being 
lined up outside the nurse’s office (one О 
the testing stations), and the anxiety gen- 
crated by having measures taken of their 
learning (although in all cases Ss were ea 
sured that their scores would be known to П 
one in the school and would never form any 
basis for a report card mark). 

Of the remaining 103 Ss, 7 from Wasatch 
served to provide some of the control ma 
The data in Table 1, therefore, are basee 5 
96 Ss—16 boys and 16 girls at each of thri 
schools. = T 

'The 96 Ss undertook the experimen 8s 
their respective schools. At each schoo i 
were preselected in the sense that pupils v 
IQs either below 100 or much above i 
were not considered, and equal number. 
boys and girls were assigned to each pues 
ing condition. Ss within each school W^. 
then randomly assigned to one of the 
training conditions. 


RESULTS 


А пей 

Тһе two main measures obtal sii 

from each S were the scores for T 
fer Tests 1 and 2. These scores W 


TABLE 1 
NUMBER, SEX, AGE AND IQ or Sussects, BY SCHOOLS 
Sex РА 
T 10 
School Grade N Age range X age IQ range x 
ar E] 
6: 
j j 5 2 7 3-133 | 117- 
Jasatch 6 32 | 16] 16 | 11-3 to 12-2 | 11-7 | 103-13 31i 
Bernd 5-6 32 |16 [16| 11-0 to 12-3 | 11-7 | 101-131» ni 
Whittier 7 32 16 | 16 | 11-11 to 13-4 12-8 103-127* 
Total 96 | 48 | 48 


а Pintner Intermediate Test. 


b Wechsler Intelligence Scale for Children. 
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conver 
Pee тт to error scores by sub- 
in for each xa. alue of the point 
curves m 2 show the learning 
These = | resulted for Tests 1 and 
that no | T н give the impression 
mud =n RA between treatments 
antes a Test 1, but that differ- 
which co Ser gn Test 2, a finding 
Teported ha reg closely with that 
cred by Judd (1908). On Test 2 
ion uc ic condition and the condi- 
teria] pre e compressed visual ma- 
appear ге ed by a realistic problem 
arbitrar Perior to the other two. An 
analyses. decision was made to run 
Seven 2. the scores of the first 
on shots з only, since scores obtained 
Onger re beyond this point were no 
Shot Se ene of the group. By 
Tost reir seven, most members of 
tion, and ments had achieved crite- 
Point tend thus, shots beyond this 
e scor to be weighted heavily by 
Showed litt of one or two Ss who 
00, with tle or no learning. Obviously, 
a sufficient number of shots, 


zo- 
1754 ө— үн! 
----- е viSUAL- COMPRESSED 
a.c. REALISTIC 
1 o— —о PROBLEM 
54 
1.254 
ERRO 
Д 
Scone 1.0 4 


ттт 
5 6 оп 12 
Fi, 1 T SHOT NUMBER 
> te; А 3 
ae 1 learning curves showing 
condition separately. 
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o—— VERBAL 

Ф©-----® VISUAL-COMPRESSED 
2---:-ә REALISTIC 

о— —o PROBLEM 


ERROR 
SCORE 2.5 


s 
3 $ 6 7 8 9 юн ё 


Р R 4 
SHOT NUMBER 


Fic. 2. Test 2 learning curves showing 
each condition separately. 


ur treatments would 
eventually coincide and differences be- 
tween treatments would disappear. 
Control data, used for establishing 
a base line from whieh the amount of 
learning resulting from instruction 
could be estimated, were 0 
from a block of eight control Ss in 
the pilot group and a second block of 
seven Ss drawn from one of the ex- 
perimental schools. These Ss were 
merely given Tests 1 and 2, and apart 
from a small amount of information 
contained in the initial sentence of 
their instructions for Test 1, they re- 
ceived no instruction or information 
relating to refraction ог displacement. 
Comparisons were made between the 
control and the experimental groups 
using Ss’ mean error scores. A signifi- 
cant difference (p < 01) was ob- 
tained on à t test between the error 
scores of the control groups and the 
error scores of the experimentally 
trained groups, 48 shown in Table 2. 
d t test was made 


In addition, à secon 
comparing control and experimental 
groups; ut this time making use of 
trials to criterion. scores. The results 


curves of all fo 
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TABLE 2 

COMPARISON or CONTROL MEAN WITH THE MEAN оғ CONDITIONS 1-4 OBTAINED 
i IN Test 1 

School Control mean | esperimental ] 
(N = 15) mean (N = 64) 
Error scores, shots 1-7 Wasatch, Grade 6 iom Ре ая 
Stewart, Grade 7 
Trials to criterion Wasatch, Grade 6 АБ -— -— 


Stewart, Grade 7 


жр < 01, df = 77 (2-tailed test). 


of this test were also significant (p « 
01). 

кь 3 gives means and standard 
deviations for performance on Test 1 
and Test 2, respectively, for each of 
the training groups. An analysis of 
variance was made of Test 1 error 
scores, trials 1-7, and also of Test 2 
error scores, trials 1-7. The analysis 
of the results of Test 1 showed no sig- 
nificant differences between treat- 
ments, or between those who verbal- 
ized and those who did not, or between 
male and female Ss. 

"These results, coupled with those of 
Table 2 comparing control and experi- 
mental groups, might appear to indi- 


cate that, although teaching pupils 12- 
formation facilitates performance б 
a task requiring application of t G 
information for its solution, the man 
ner in which the information is taug 


der these conditions, on the b "d 
our particular interest in Conditio 


: infor- 
2 and 3, which vary irrelevant info 2 
mation, a ¢ test was run using dicate 


mean error scores. The results 1n 


TABLE 3 ай 
Means AND STANDARD DEVIATIONS or MEAN Error Scores ror PERFORMAN 
on Test 1 AND ТЕзт 2 


Test 1 Test 2 

Group pot 

i i G t ; up 200 

poem EE D m 
ШИШЕК, „ш 

sD 
M SD M SD M SD ы Mod. 
2.62 
1. Verbal information only | 1.60 | 1.49 | 1.24 | 1.70 | 2.32 | 2.43 | 4.34 ў 
уі Ту compressed information si 
^ а ibformation 1.36 | .80| .99 | .64 | 2.99 | 1.26 | 3.19 T 

isti onstration plus " 
А on ее на .62 | .76 | 1.54 | 1.16 | 1.45 | 1.19 | 2-58 ў 
А liti 2 

4. Problem plus Learning Condi ion dia quil con | oct bom | um Mer 

2 


Note.—N = 12 for each cell. 


> EN 3 1 —— — —/u—R 
——— SN: — 5 — . < 
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— TABLE 4 
Kon ae T iud New MULTIPLE- 
3 mt E OMPARING MEANS OF TEST 
CORES FOR CONDITIONS 
1-4 AND Ѕнотз 1-7 


Treatment 4 3 2 1 
M EN 
Condition 1.90 | 2.01 | 3.09 
iS є +. Д 3.33 
problem (4) Е 
Visual?) fa лр 
Verbal (1) P eed (2) = as 


tl OR 
d d aiming Condition 3, making use 
ratus ee-dimensional physical appa- 
ааа aed significantly more 
(р < pon did Training Condition 
applicatio 6, df = 46). Furthermore, 
е-га, lon of а Duncan's new multi- 
Scores ӨЕ test indieated that Test 2 
(propje теййшаца 3 (realistic) and 
ore ] em) reflected significantly 
Тр 08 than did scores on 
Compracnts 1 (verbal) and 2 (visually 
< .05 e , at levels varying from 
tion, ш ae Дь е. addi- 
Nee а у, де Test 2 analysis of vari- 
Main eg ibalization-Nonverbalization 
Signin с reached the .08 level of 
ion ance in favor of the verbaliza- 
Condition, 


Discussion 


1 
to соте obtained would appear 
ang Bm the findings of Judd (1908) 
(1941) tudriekaon and Schroeder 
Princip] at knowledge of a pertinent 
task re e facilitates performance in & 
ciple quring application of that prin- 
taineq furthermore, the results ob- 
Test 2 A the analysis of variance 
that wh Cores would appear to indicate 
Че €n a transfer task is sufficiently 
dent is re] е amount of transfer evi- 
he eley. ated to the manner in which 
gj The ieee principle is taught. 
беа ults indicate that опе of the 
чы ing i variations in manner of 
"eje, 5 15 the presence or absence of 
information; that is, in 
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terms of performance on Test 2, the 
realistie condition (3), with all its ir- 
relevant cues, is significantly better 
than the visually compressed condi- 
tion (2), which has had most of the 
irrelevant visual cues removed. 

| However, we must note that teach- 
ing Conditions 2 (visually compressed) 
and 4 (problem) are identical, except 
for the introduction of a problem to 
establish set as the initial phase of 
Condition 4, and yet Condition 4 re- 
sults are significantly better than 
Condition 2 results. It would appear 
obvious then that it is not the presence 
of irrelevant information alone which 
can account for the facilitation of the 
application of the principle to à new 
situation. The establishment of a set, 
prior to dealing with the visually com- 
pressed material, appears also to have 
had the same facilitating effect upon 
performance as the presence of irrele- 
vant information in the realistic con- 
dition (3). 

How can the failure of the visually 
ed treatment (2) be ex- 
plained? Probably, on occasion, line 
drawings compress information in a 
psychologically inappropriate fashion, 
or actually eliminate relevant infor- 
mation. 

However, even if this were true of 
the line drawings used here, we are 
still left with the task of explaining 
why the mere act of preceding Con- 
dition 2 with a rather simple problem 
for which, and from which, Ss re- 
ceived no feedback should have had 
such a facilitating effect. If we assume 
that some set was established and that 
somehow some process of selective at- 
tention was aroused, we must still ask 


uch questions 8$: 
ji id Dom the establishment of a set 


somehow permit S to reshape inap- 
propriately compressed information 
into an appropriately compressed 


form? 


compress 
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2. Does the establishment of a set 
inevitably provide additional relevant 
i ion? 

E a clue to the solution of the 
problem lies in the second of these 
questions. Berlyne (1960, p. 63) notes 
that one of the factors affecting proc- 
esses of selective attention is a class 
of stimulus events which he labels 
indicating stimuli, a class which in- 
cludes verbal directions. It is logically 
conceivable that the establishment of 
set serves to arouse processes of selec- 
tive attention similar to those aroused 
by Berlyne's indicating stimuli. If this 
were so, possession of an appropriate 
set might serve to permit S to select 
out from the stimulus complex those 
stimulus elements which contained the 
most information. These highly loaded 
pieces of information to which he has 
now directed his attention may, in 
turn, serve to permit S to process the 
remaining information in a manner 
different from another S who, lacking 
set, perceives the stimulus complex in 

a relatively unselective fashion. In 

other words, we are really suggesting 

that provision of a set may be compar- 
able to the provision of additional in- 

formation. Reese (1964) has discussed 
the phenomenon of performance set 
which would also appear to be closely 
related to the point of view expressed 
here. 

An explanation of the facilitating 
effect of set, such as the above, how- 
ever, seems to do little to explain the 
apparent facilitating effect of irrele- 
vant information, which was evident 
in the realistic treatment (3). One 
clue did emerge from the written an- 
swers provided by the verbalization 
task. Several of the written answers 
of pupils exposed to the conditions 
which used line drawings stated that 
the coin appeared to move “to the 
right side, and slightly closer to the 
surface of the water.” Ss who provided 
this type of answer appear to have 
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failed to differentiate between those 
drawings which showed an end eleva- 
tion and those which showed a side 
elevation. This confusion was not evi- 
dent with Ss in the realistic treatment. 
In addition, inasmuch as the criterion 
tests involved a form of depth percep- 
tion and inasmuch as depth percep- 
tion is dependent on three dimensions, 
or the illusion of three dimensions, it 
is possible to hypothesize that а 
visually eompressed treatment, ve 
ing a third dimension, or even the 1l- 
lusion of a third dimension, actually 
was deficient in relevant information 
as compared with the realistic treat- 
ment. А 

The third variation in training con- 
ditions investigated in this experimen” 
was provided by the Verbalizabiorr 
Nonverbalization conditions, bow" 
one-half of the Ss of each icone 
were required to give written pe ed 
to two questions relating to the 1010 © 
mation to which they had just Kon 
exposed. No significant differences Бе” 
tween the Verbalization-Nonverba i 
zation groups appeared in Test 1. 
Test 2, a difference favoring the que 
who verbalized did appear, but Te 
significant only at the .08 level. TP 
however, would suggest the possibillty 
that verbalization prior to a Ы тя 
task of sufficient difficulty may fac! 
tate performance. fer 

Finally, we must ask why ng 
was very much the same for all in 
experimental training conditions 1 
the initial transfer test. Judd, himse 
noted a related phenomenon in dit 
original experiment and attribute n- 
to the fact that his Ss were still m a 
ing to throw the darts. As very Н as 
learning of an “aiming” nature bn 
required in this present experimen” 
his explanation would seem not to а 
ply. Hendrikson and Schroeder COW 
cluded from their experiment that Р ae 
Session of knowledge did facili 
transfer in the initial task. HoweVe" 


э Q————. 
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тран о of the results on which 
By on ase this conclusion shows that, 
Sica ORABAT standards, their differ- 
the o. etween the control group and 
Fei 9, experimental groups, ог be- 
| Mes two experimental groups 
ninot ' were not statistically sig- 
бор ded the discussion, to this 
Varios have observed that certain 
pear t E in training conditions ap- 
effect ol have a notable facilitating 
Cult sec performance of a more difi- 
and if saag task. If we are correct, 
vidi e find no facilitating effect of 
опа me variations in training con- 
cult "ee odia a similar but less diffi- 
Bd ine , We would appear to be justi- 
all ex oneluding the following: Ss in 
enough peg us treatments learned 
йөр T solve the relatively easy 
on it ; ask 1; that is, for its solu- 
ing ne relatively little learn- 
ing this minimal amount of learn- 
УР provided for by all four 
consi -— In contrast, Test 2, being 
а алыи more difficult, required 
Which h. "i learning for its solution 
А ities been acquired only by those 
and 4 sed to Conditions 3 (realistic) 
(problem), 
bilition vom at least two other possi- 
Mrd be considered. The first 
ere at a that the scores on Test 1 
ents The chance level for all treat- 
Visual 4 hat is, having fired at the 
Was а i get and being told that it 
Proce бек as many Ss would then 
as would o lower their point of aim 
elow 44, Proceed to raise it. As aiming 
Опе poi € apparent target as little as 
Woul, ce of the scale on the grid 
reg eie. in a hit, those who low- 
т 2 aim would soon be on their 
arison иа s-eye. Although the com- 
men 8085 control scores with treat- 
Seem to res shown in Table 2 would 
discount this possibility of 


ti 


chance-level performance, we must 
recognize that control data for this 
experiment is somewhat deficient both 
in N and in comparability. 

The second possibility is related 
both to the difficulty of Test 2 and to 
its position. While performances in 
Test 1 were essentially the same for 
all experimental groups, only Ss of 
Conditions 3 (realistic) and 4 (prob- 
lem) were able to benefit substantially 
from the learning experience which re- 
sulted from participation in Test 1. 
That is, we might hypothesize that, 
because of their exposure to à particu- 
lar type of instruction, Ss were able 
to process the feedback which they 
received while participating in Test 1 
differently from those Ss who had re- 
ceived a different type of instruction. 
Thus, while all Ss, in effect, went into 
Test 1 with equal ability, those who 
had earlier received instruction in 
Conditions 3 and 4 emerged from 
Test 1 with superior ability. 

In the light of the results reported 
and discussed above, it would appear 
that we may conclude tentatively that 
the most efficient conditions for trans- 
fer exist where irrelevant cues present 
in the testing conditions are also pres- 
ent in the training conditions. To gen- 
eralize further, we might hypothesize 
that the most effective learning, as re- 
gards transfer, occurs in the presence 
of irrelevant cues. To test this hy- 
pothesis we must verify whether the 
negative effect of the visually com- 
pressed treatment was due to the par- 
ticular line drawings used, or whether 
it was due to the absence of irrelevant 
cues per se. Second, we must ask our- 
selves what the relationship is between 
the presence Or absence of irrelevant 
cues in the training situation and the 
presence Or absence of irrelevant cues 
in the test situation. To answer these 
questions an experiment was designed 
which will be reported on separately. 
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AUD 
IENCE RECALL AS A FUNCTION OF SPEAKER 
DYNAMISM 


M D. COATS 4x» ULDIS SMIDCHENS 
chool of Education, University of Michigan 


Various relationships among speaker dynamism 


, grade threat, indi- 


vid Í 5 
ual speaker, and immediate recall with respect to a sample of 184 


college speech students were 
exp) 
toe oe of a 3-factor, fixed-effects analysis of v: 
eee ee of association. The only relation whi 
d а strong was between speaker dynam! 
e speech by the audience. The relation produce 


lored. The data were analyzed by 


ariance and the «w° test 
D ch was both signifi- 
ism and immediate recall 
d а significant F 
ation (ш?) of 36. 


ratio at the .001 level i 
It d evel and had a strength of associ 
was concluded that audiences remember more from a dynamic 


lecture than from a static lecture. 


Ir А 
dg ee of whether or not the 
or M is the best technique 
ensive] ing, it is in fact used ex- 
for tudo S many classrooms and 
DVestigati S of all ages. Hence, new 
vhic re me delivery variations, 
ig LU Eo? influence lecturer effec- 
Particula are always needed. The 
Which the „980806 of lecturing in 
interested authors of this paper were 
tion, Atta ы that of auditor atten- 
Process of oo was defined as the 
ear upo ringing the sense organs to 
CE egere stimuli out 
“eptual feld available in one's per- 
study Theoretical development of the 
three уар built upon the following 
Selective 0 tions: (a) Attention is 
attentio (an individual pays more 
Others) m to some stimuli than to 
(the dun (b) attention is fluctuating 
single ation of absolute attention to 
Onds) « stimulus is only a few sec- 
rectly’ (c) in order to remember di- 
lecturer Y statements made by & 
of the pe must attend to some part 
believe lecture. The experimenters 
JUStifieg these assumptions were 
1Onsider by intuitive and logical 
Bical ations as well as by physio- 
1 al aga ү psychological research 
935; ру 1914; Chapman & Brown, 
Of ы Sbury, 1908, Chs. 1-4). 
many concepts which relate 


to the problem of attention, the 
authors felt that dynamism (change 
or variety, animation, and power in 
the speaker) is the most important. 
An examination of the three above 
assumptions indicated that a dy- 
namic lecture delivery should have 
more success than a static one in 
eliciting and maintaining the at- 
tention of a class. The specific hy- 
pothesis follows: 
1. Students will remember more 
from а given lecture when the 
ic than when it is 
he hypothesis of 
d which most of 
the design of the experiment evolved. 
Furthermore, it was believed that 
artificial motivation due to grade 
threat might positively affect class 
h such à considera- 


attention. Althoug 1 
tion was somewhat divergent from 


the main thrust of this paper, relevant 
data were obtainable with little 
change in 


the experimental design. 
The second hypoth 


esis follows: 

2, Students who are told that 
they will be tested on the content of 
a lecture will remember more from 
that lecture than students who do not 
receive suc 


h a grade threat. 
METHOD 
Design 


The stati 
factor, fixed-e 


stical design consisted of a three- 
ffects analysis of variance. One 
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variable was type of lecture, an- 
cese eid grade threat, and the third was 
the individual doing the lecturing. The de- 
pendent variable was the immediate recall 
by the class of the lecture. Also, the design 
called for the calculation of the various a 
(Hays, 1964) values in order to ascertain 
the proportion of variance accounted for by 
the respective treatment effects. A planned- 
comparison technique was used to evaluate 
audience perceptions of speaker dynamism 
under the various treatment combinations, 


Procedure 


The subjects (Ss) consisted of 184 stu- 
dents in eight introductory Speech classes 
at the University of Michigan. Two Speak- 
ers, graduate students in education at the 
University, each lectured to four different 
re about classroom- 
Were expository in na- 
tent, and lasted ap- 


static fashion each was c 
ated prior to the data coll 
professor at the Univers 


The other manipulated variable dealt 
with whether or not the class was directed 
to pay close attention to the lecture because 
of a grade threat. Any specific speaker gaye 
a dynamic presentation to a class which had 
received a grade threat and to a class which 
had not received such a threat. The same 
speaker also gave a static delivery to a 
grade-threatened class and to a class which 
was not grade threatened, 

The class instructors introduced the 
Speakers by saying: "This gentleman is go- 
ing to talk to you for a few minutes.” In 
classes where the grade threat took place the 


instructors added: “Pay attention to the 
lecture because I will expect you to know 
what he has talked about after he has 
finished.” | " 
In addition to the dynamism scale eac 
class was given a 10-question шше лога 
test immediately after the lecture. This tes! 
measured immediate recall of what had perd 
said in the lecture and had a E der Bankon 
son reliability of .64 (Guilford, 1956). A of 
the recall test, was administered toa ср 
group to see if Ss had much previous know! 
edge of interaction analysis. © 
In order to decrease the effects of a ied 
sible lack of equivalence among the Ee 
variances of the groups on the results of B 
analysis of variance, the group sizes 079 
made equivalent, (20 per group) by п: 
ing or increasing the group size as шесевгагу 
after the data were collected. A Eon 
Was decreased through the use of ran dn. 
digit tables (Fisher & Yates, 1963) E : 
creased by considering the mean of a 
group as an observation the necessary ae 
er of times, Six of the groups were А 
creased in size, one group size was ag 
by one, and one group size was increase a 
Seven. The equivalence of the group, КАП, 
also facilitated the making of meaning 


ch 
planned comparisons (Hays, 1964), Т 
provided a more powerful statistical ana. 
than conventional ¢ tests, 


RESULTS 


The planned comparison used = 
test the difference between the ee 
namic and nondynamic presentation 
Produced an F ratio of 708 km 
Was significant at the .001 ien 
with df = 1159 Clearly, the aud “a 
ences’ perceptions of the dy pst 
and nondynamie presentations Ye 
quite different and in the expected 
rection, d 

Table 1 presents mean scores aa 
standard deviations on the imme " 
ate-recall test for each of the group". 

n analysis of variance for the 2e 
mediate-reca]] Scores produced F Кы 
tios which were significant for all > 
the main effects and interaction 52 
fects at the .01 level with df = 1,1 i 

Owever, the only independent var 
able which was strongly associata 
with the dependent variable was t 


EFFECT or SPEAKER DYNAMISM 


И ТАВІЕ 1 
ANS 4 TT. 
aa SrANDARD DEVIATIONS OF THE 
MMEDIATE-RECALL TEST 


Group M Sp 
Dynamic 
Grade threat 
ecturer 1 à 
, ecturer 2 88 E 
“NO grade threat | pa 
:ecturer 1 
Lecturer 2 88 125 
ondynamie | i 
rade threat 
ecturer 1 5 Т 
ecturer 2 7.2 138 
о ide threat К i 
cturer 1 7 
“ecturer 2 73 гн 


abou 
коз, 'speaker dynamism" ac- 
no other d 36% of the variance, while 
кш 10% ariable accounted for more 

" j 
est ine on the immediate-recall 
of 3, 1e control group had a mean 
l54 A b a standard deviation of 
Scores ot sc arison of the mean of the 
the: ares ы the control group with 
received of the scores of Ss who 
Dent aid. nondynamie lecture treat- 
computin grade threat was made 
ing a Student ¢ ratio using 


ol 5 
аап шше of the population 
- The £ ratio was found to 


be 1 
438 wi 
001 pr df — 58 (significant at 


Discussion 


Beca 
оце ol of a faulty procedure used 
ass, results related to the 
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second hypothesis (effect of grade 
threat) were inconclusive. Also, due 
to the restricted nature of the sample, 
the generalizability of results is 
somewhat limited. However, the data 
very convincingly supported the ma- 
jor hypothesis. Students did remem- 
ber much more from the dynamic lec- 
ture than from the static one. Such 
a finding has strong implications 
for both practice and future research. 
The work and planning, which are 
requisite to effective dynamic speak- 
ing, are evidently rewarded with a 
high degree of audience attention. 
The relation which was demonstrated 
to exist between speaker dynamism 
and audience attention might be 
considered by teachers as well as 
by paper readers at conventions of 
professional organizations. 
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PERCEPTUAL SPAN TRAINING AND READING 


ACHIEVEMENT OF 


BRUCE R. AMBLE! 
Des Moines Child Guidance Center, Iowa 


To determine if a reading program 


AND 


SCHOOL CHILDREN 


SIEGMAR MUEHL 2 
College of Education, State University 
of Iowa 


of over 5,000 phrases would help stu- 


improve their reading skills and be a useful supplement to regular 
е € С нме 3 experimental studies were conducted. The Hee 
training program was designed to increase perceptual span and help в a 
dents develop habits of integrative phrase reading. Ss were intermedia‘ 
grade-school children and remedial readers. Comparisons between ex- 


perimental and control 


tained in follow-up posttests. 


Several investigators (Buswell, 
1939; Gilbert, 1953; Taylor, 1958) 
have provided evidence for increased 
efficiency of visual and perceptual 
skills associated with reading during 
the school years, However, as Taylor 
has noted, the development of those 
skills—as measured by various eye 
Movements, span of perception, and 
rate of comprehension—has been left 
largely to trial-and-error processes in 
the school program. Although various 
controlled reading devices and instru- 


ment techniques have been devised to 
train these skills, 


opinions regarding 
their usefulness have been conflieting 
(Spache, 1958; Taylor, 1958; Tinker, 
1946). Research results in this area are 
also dissonant. Of particular interest 
in the present study were the research 
findings related to tachistoscopic 
training designed to improve reading 
rate and/or comprehension skills by 
increasing the accuracy, speed, and 
span of recognition. 
Investigations conducted by Mano- 
lakes (1952) and Sutherland (1945) 
with college and adult readers, and 


*Now Assistant Professor, Department 
of Educational Psychology and Guidance, 
College of Education, Southern Illinois Uni- 
versity. 


groups significantly supported the use of the 
phrase reading program. Both readin 


improved, depending on the criterion t 
were consistent for low, medium, and ви 


£ comprehension and reading rate 
est employed. Reading gains 
perior readers and were main- 


Marvel (1959) with high-schoo: 
sophomores found  tachistos¢ 2 
training procedures to improve e oen 
ing skills were not as effective as © ro- 
types of reading improvement fum 
grams. The studies by ag 
(1948) with college freshmen, ixth- 
muth and Aker (1961) with P with 
grade students, and Goins (1058) d 
first-grade elementary school ons er- 
Showed no differences in rondine PE 
formance for subjects (Ss) wit are 
chistoscopie training when сор per 
With control Ss who received ir ain 
different or no special reading a rent 
ing. Other studies with widely di Мав" 
populations (Dumler, 1958; not 
Latchy, 1946; Renshaw, € sige 
using control procedures саше» е 
nificant reading gains for 1801939) 
Scopically trained Ss. Weber Cow 
found tachistoscopically trained а 
reading college freshmen Кр with 
matched group of Ss traine Ете" 
Pressey's Manual of Reading eases 
cises for Freshmen showed (ee соп“ 
in reading rate when compared d ele- 
trol Ss on a follow-up test. At © а Van 
mentary school level, Jones an read" 
Why (1963) reported favorable Topic 
ing rate gains after taehisios or- 
training for fourth-grade Ss ao for 
able reading comprehension #3 
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fifth- 
UN nes 
a a pti oot ahs fed mee по 
арн Studies attempted to assess 
land (1045) ren of recognition. Suther- 
5 made ) reported her experimental 
span M erii gains in reading 
the cont T4 no tests were made of 
йб wi a Ss. Freeburn (1948) found 
mental a. differences between experi- 
In eed control Ss after training. 
ing to "ea perceptual span train- 
een use “pe reading efficiency has 
highly primarily on older Ss with 
very es reading habits or on 
Sults of wd beginning readers. The re- 
оса], M Á studies have been equiv- 
evidenced шоны) problems were 
cedures Thi inadequate control pro- 
Practice e amount of tachistoscopic 
Materials with experimental reading 
n additio was not impressively large. 
gures um nonsense forms, geometric 
rainin digits were often employed 
Published 4 materials. A review of the 
9r furth research suggested the need 
creased xe investigation using in- 
Ore ap mounts of training materials 
With е, amp to the reading task 
Brade.. SS experienced, intermediate 
ч аснод children. 
Cuties as mental studies were 
tachistos to assess the usefulness of 
ading copieally presented phrase- 
mathet o riale, In Experiment 
puja UPS of fith- ad sixth- 
distributi ents each had five hours of 
W Get training. Experiment 
ings EE to cross-validate the 
investi xperiment I. Experiment 
leading ey the use of phrase- 
Brade- an d aining with intermediate 
Temedi Junior high-school students 
lal reading programs. 


The fi EXPERIMENT P 
Provide Ee investigation attempted to 
Mswers to four basic ques- 


2 
uss 
be 
riment I was based on a doctoral 


tions: Does the span of recognition im- 
prove with tachistoscopic phrase-read- 
ing training? Does reading rate and/or 
reading comprehension improve with 
tachistoscopic training? If there is im- 
provement, do low, average, and su- 
perior readers improve in the same 
areas at the same rate? What is the 


relationship between span of recog- 
nition and reading achievement? 


Materials 

Motion picture films were developed to 
provide tachistoscopic-type practice phrases 
and to test phrase-reading skill. The films 
were projected at sound speed with 16-milli- 
meter projector equipment. Each phrase ар- 
peared on the screen for V$ second, enough 
time for a single reading fixation (Tinker, 
1929). The letters in the phrases appeared on 
the screen white against & contrasting grey 
background, designed for visual comfort 
under sustained practice conditions. Two 
sets of phrase-reading films were designed 
to provide systematic exposure of many 
phrases. 

The first series, designated the 5200 
Phrase Reading Film Series’, consisted of 
10 separate training films, with each film 
containing 260 different phrases. The films 
twice during training, which pro- 


one to five words wl 

of longer phr 

There was & 
esentations 

o pare for the next 


d on the screen 37 


te at this rate. Г 
The second series of hrase films, desig- 


ated the 420 Phrase Film Series, consiste! 


ur 
The author 


dissertation (Amble, 1963). 10 
gratefully acknowledges the assistance of 
Siegmar Muehl and James B. Stroud, Co- 
chairman. The cooperation of Arnold Hjelle 
and the Sigourney, Towa, Public Schools 
h appreciated. Experiments П and 
nducted in the Des Moines, Tora 
i hools with the support and co- 
pue f John Harris, Robert Denny, 
. Peterson. 
x e Audiovisual Center, 
Division © University Serv- 
ices, State University of Iowa, Iowa City, 


Towa. 
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ial ith each film contain- 
i s ied e phrases мае 
mg lfth phrase in the 
selected as Lites goes twice, pro- 
first beu of 420 practice phrases. There 
een жы intervals between presenta- 
ficus which gave the students time to read 
each phrase and write it on paper d 
for this purpose. The projector was stoppe 
after each phrase appeared on the screen 
employing the technique suggested by Gil- 
bert (1959a, 1959b). Phrases appeared on the 
screen three times per minute at this rate. 

The phrases used in both film series were 
selected from third-grade reading materials. 
They consisted of meaningful units when 
more than one word in length. Several ex- 
amples are given: shop, ribbon, department; 
just one, too much, winter came; the long 
night, an old boat, the next morning; say it 

out loud, the day was warm, under the ap- 
ple tree; they looked at each other, a 
streak of red paint, the red and yellow 
leaves. 

Following each practice film 
series, a similarly designed daily 
also on film, was administered t 
immediate effects of training, 
lish and maintain motivation 
each contained 15 phrases, 
words in length, During 
test, the projector was st 
phrase appeared on the s 
given time to write out th 

Three alternate-form с 
(designated A, В, and C) were developed 
to assess changes in phrase-reading perform- 
ance (span of Tecognition) before and after 
training. Each test contained seven single- 
word warm-up phrases and twenty-five test 
phrases. The test phrases ranged in length 
from one to five words. The criterion phrase 
tests were presented in the same manner as 
the daily phrase tests. The Warm-up phrases 
were provided to standardize test conditions, 
and during the pretest were to Screen Ss not 
able to read single-word stimuli from the 

sereen. Mimeographed answer shects were 
provided Ss to write out their test re- 
sponses. : 

The Iowa Silent Reading Test (ISRT), 
Elementary Edition, Forms AM, BM, and 
CM were administered before and after 
training, to assess changes in reading rate 
and reading comprehension. Subtest 1, Rate 
and Comprehension, and Subtest 4, Para- 
graph Comprehension, were used for this 
purpose. The Iowa Test Basic Skills (ITBS) 
is an annually administered school test. Vo- 
cabulary and Reading Comprehension test 


in both 
phrase test, 
O assess the 
and to estab- 
- These tests 
from one to five 
the daily phrase 
оррей after each 
creen and Ss were 
cir responses, 

riterion phrase tests 
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i year 
scores were obtained from it Ms nem 
preceding and the year following the 
training program. 


Procedures 


- and 
Subjecis. A total of 145 Ss, 76 AA the 
69 sixth-grade children, ра араар inge 
investigation. The fifth graders esi months, 
from 9 years 9 months to 11 years ths, For 
with a mean age of 10 years 3 on years 
the sixth-grade class, the range was with а 
11 months to 12 years 10 Ee oy te final 
mean age of 11 years 6 months. Of the 2 
analysis, 120 Ss were included. геепїп& 
eliminated, 8 failed to pass the y match- 
test, 10 were lost in the preliminary ab 
ing and 7 were absent during the p. to com- 
Design. The study was designe roaches 
pare the effectiveness of two SL reading 
of phrase training on subsequen sed the 
achievement. One training method u ed to 
5200 Phrase Film Series, The Ss [por 
this procedure received a phrase ses. The 
program of 5,200 continuous ap Phrase 
Second training group used the 42 hrase- 
Film Series. These Ss received hich were 
training program of 420 phrases wh ing. 
written out during the course of train nd two 
The children in two fifth-grade = Тома, 
sixth-grade classes in the Sigeume gust at 
Public Schools were pretested in peni a 
the beginning of the school AM an 
reading achievement test, ISRT- de were 
Phrase Test A. The Ss in each gra ent 88 
ranked in order of reading achieven sub- 
determined by an average of the IS stitute 
tests. Three reading levels were Бат а 
by counting off 20 Ss at a time, boei grade. 
the top of the distribution in enpa т were 
The levels constituted by this metho ow 
designated as the high, medium, ie were 
reading groups. Ss within each 500-phrase 
assigned alternatively to the mek the 
group and to the 420-phrase group. groups 
esign consisted of two reaming | repli- 
matched at three reading levels, wi 
cations at the fifth and sixth grades. a 
The students in each classroom А 
phrase training at the same time ea distri- 
in their respective programs. The oa An 
bution of Ss both by training A == 
reading levels within the four cla been 
Was a chance factor since the Ss ey jnde- 
assigned to groups and reading leve o 
pendent of class membership. One nine 
authors and an associate served as training 
instructors for these periods. The betwee? 
groups in each class were alternated ch in^ 
the instructors so that over all, ез 


ceived 
n day 
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structor trai 
group. А about half the Ss in each 
that half d on of training was altered so 
were e Be ap in the 420-phrase group 
the auditori in the classroom and half in 
employed Wk a comparable procedure was 
projectors xd. i 5,200 phrase group. Film 
he trainin, screen were alternated daily. 
through 10 g period for all Ss continued 
,. Immedi ae ee school days. 
ing nities D mos the phrase-train- 
Were admi he ISRT-BM and Phrase Test 
est C w nistered. ISRT-CM and Phrase 
three morte administered in December, 
pleted, hs after training had been com- 
The i ы 
ements Loma for both film series were 
о the phra he same. The Ss were introduced 
reading p se-training program as à praetice 
improve pes тер and were encouraged to 
Was BUS the daily phrase tests. Each film 
Were ene s tic daily in sequence, and Ss 
time e ag to do better the second 
raining and. it. As Ss became familiar with 
essary to cl Tost procedures, it was not nec- 
structions, The dai reading the detailed in- 
Sions were he daily training and test ses 
In both ex about 25 minutes in length for Ss 
aw шеи conditions. 
Were posted 1 from the daily phrase tests 
t large po before the homeroom perio 
lon, If a MEA charts to maintain motiva- 
hys seore udent improved on his previous 
ms Score, e, a gold star was fixed next to 
"ойго е 
aDsent, gl Ss included in the study were 
11е mean spa testing period. In these cases 
чавд Qo iHe for their training group WaS 
Gre On пра the missing individual's 
i = 72) = ITBS fewer matched S pairs 
mplete his used in the analysis since 
ata were not available for all Ss. 


A fo RzsuLTS 

design oe aem analysis-of-variance 
quise паз used with the data (Lind- 
Common The two between factors 
Teading i ү all analyses consisted of 
and low) ility levels (high, medium, 
А eo grades (fifth and 

all anal e within factors common to 
рз 29 consisted of matched 
Ds) an тума cm and 420-phrase 
= p and test periods (pretest, post- 
темор postposttest). Each of the 
rately ‚ Measures was analyzed sep- 
Phrase Сны daily and criterion 
ests; ISRT—Rate, Compre 
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hension, and Paragraph Comprehen- 
sion; and ITBS—Vocabulary and 
Reading Comprehension. Phrase test 
scores used in the analysis were raw 
scores. Standard scores were used on 
the ISRT and grade-equivalent scores 
were used on the ITBS. 

To make for economy in reporting 
the results of the various analyses, the 
main effects associated with reading 
level and test periods can be sum- 
marized immediately, since these ef- 
fects were similar in all analyses. 
Reading achievement levels and read- 
ing performance increases over test 
periods were significant in the ex- 
pected direction (p € .001). 

The main effect of grade placement 
was significant in the expected direc- 
tion in two analyses: ISRT Compre- 
hension (p € .01) and ITBS Reading 


Comprehension (p < .025). The inter- 
reactions of test periods with reading 


levels or grade placemen 
i hese scores are col- 


reported since t 
lapsed across training groups, and for 
this reason are not of interest. 


howed & significant 
triple associated with 
training grou k 

05). The interaction indi- 


riods (p € · i 
cated that the sixth-grade 


5,200-phrase group made m 
uous gains during the training program 


than did the other Ss. 


топ Phrase Tests 
ed a significant 


Criter 
-aini o reading levels, 
training € (p € 105). The character 
si „action 18 shown in Table 1. 

The Ё tests indicated that the medium 
level, 5,200-phrase group, read 

y more of the phrases im- 
ly following training than did 


mediate 
level, 420-phrase group. 


the medium 
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TABLE 1 
MANCE OF 5,200- AND 420-PHRASE 
EN 2 ON CRITERION PHRASE Tests 
(Experiment I) 


Dec. 
айий Reading | Aug. | Sept. st- 
Tenning N | “Tevel | pretest |posttest Кет 

32.235 | 37.70 | 40.50 

Smo pham: тан 42.90 | 51:53 | 51.30 
20| High |4803 | 54.98 | 01.30 

20 | Low 7.78 | 36.88 | 39.35 

Copies. [an от 39:88 | 4110 | 47.58 
20 | High | 52:75 | 57.70 | 50:55 


^ Maximum score: 75 words, 


However, they did not maintain the 


advantage during the follow-up post- 
testing. 


me testing period 
Tanged from .39 to -62, for all Ss, 


ISRT Rate 


There Were no reliable differences 
associated with trainin, 


ISRT Paragraph Comprehension 
The analysis showed a significant 
interaction associated with the train- 


TABLE 2 
PERFORMANCE oF 5,200- AND 420-Prrasp 
Gnours on ISRT PARAGRAPH 
COMPREHENSION SUBTEST 
(EXPERIMENT I) 


ining groups Aug. Sept. De 
Training group: pretest | posttest Potten 
канд, 

5,200 phrase 

M 132.32 | 143.65 146.22 

GE 4.78 5.93 6.26 
420 phrase 

M 134.42 | 140.25 140.10 

GE 4.94 5.53 5.51 


Note.—N — 60 per Training group. 


ing groups and test periods (p M 01). 
The character of this interaction is 
shown in Table 2. Examination of the 
data indicated that the 5,200-phrase 
trained Ss made increasing gains a 
comprehension scores over the posttes 
periods. The £ tests showed the pap 
ences between training groups to be 
reliable on the three-month follow-up 
posttest (p « .05). 


ITBS Vocabulary 


There were no reliable difterenest 
associated with training groups the 
significant interactions among t 
variables. 


ITBS Reading 


а сн 
The analysis showed а significar? 
main effect for training groups (P 


TABLE 3 "m 
PERFORMANCE oF 5,200- AND Фао ита 
Grovrs ow ITBS Rwapina SUP 
(Experiment I) 


= -63 
Training groups | Midyear 1961-62 | Midyear 1962 
5,200 phrase 

M 4.82 6.09 
420 phrase 

M 4.78 5.56 


Note.—N = 36 per Training group- 


025) and a significant Group X a 
Period interaction (p « .025). in 
character of these effects is som 
Table 3. Although Ss in both Lies 
ing groups made significant зарою в 
ment from one year to the next, afe 
Taie of gain for the 5,200-phr^^ 
groups was greater, The Ё tests 
ween training-group means agate 
reliable difference at mid-year 196: 

(p < 01). 


ExrrRmMeNT II a 

Experiment II was designed P 

cross-validate the findings of dm 
ment I, to determine the effective 


| 
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of 
make reading program, 
the 1-а, the feasibility of adapting 
еи аы с Als for use by class- 

as part of 

Program of volo had d ase 
e IL II differed from Experi- 
Журе of ш Several respects. A different 
vide a apt group was used to pro- 
studies д of comparison. The two 
үзе. буш with respect to the 
trainin administering the phrase 
slight] " programs. They differed 
мун ee respect to the training 
йе in g Periment II was con- 
E d he spring of the year when 
the ns ents had nearly completed 
egular reading program, while 


Xper; 
the far I had been conducted in 


Materials 


The 
describe 5200 Phrase Reading Film Series 
the first eua Experiment I was modified after 
Performance y. An analysis of phrase-reading 
that the aee the initial study indicated 
Scriminatir, and five-word phrases were not 
Schoo} childne among intermediate grade- 
difti lt to ren, being either too easy or too 
o iminateq реа. ‘These phrase lengths were 
nly two. th revising the film series, leaving 
таў V ё тее-, and four-word phrases as 
three tests training program. The daily 
mace an oe shortened to seven phrases, 
щу e Our words in length, and used 
ays tho Purposes of motivation. In other 
the asic features of the series re- 

B e ае 
у ад UR phrase tests (designated A, 
Th eliminating + made more discriminating 
phe revise 95 опе- and two-word phrases. 
TH 5, three. tests included twenty-one 
vi i Ve-word four, and five words in length. 
we le dent Phrases were retained to pro- 
ei ade in top for the tests. No changes 
tig ure used the seven-word warm-up pro- 
ns, to standardize pretesting condi- 


Th 

Eje Io А 
Cip entary pent Reading Test (ISRT), 
fog, еге а dition, Forms AM, BM, and 
2, Di Subteste T used as criterion reading 
4, N 3 oes and Comprehension ; 
iste, аавтарћ ading; 3, Word Meaning; and 

е Comprehension were admin- 


апа ассогаі 
Procedu aë to the standard instructions 


197 


Procedures 


Subjects. A total of 436 fifth-grade chil- 
dren participated in the investigation. Ss 
ranged in age from 10 years 3 months to 
11 years 9 months, with a mean age of 10 
years 10 months. The Ss attended five 
public elementary schools from various social 
economic areas in Des Moines, Iowa. 

Design. The study was designed to assess 
the effects of two levels of phrase reading 
training as compared to & contro] condition. 
Two training groups and the control group 
were constituted as follows. Each of the five 
schools contained three fifth-grade classes. 
The classes in each school were randomly 
assigned to one of three groups. Thus, the 
two training groups and the control were 
each made up of five different classes (and 
teachers) representing five different ele- 
mentary schools. The training groups were 
designated according to the amount of 
phrase reading practice they received: 5,200- 
phrase group and 2,600-phrase group. The 
control no-phrase group received no phrase- 


reading training, and the Ss remained in the 
The Ss were distrib- 


usual school program. 
uted among the groups as follows: 5,200- 
144; 2,600-phrase group, 140; 


phrase group, 
and the no-phrase group, 152. 
To control for differences in ability level, 
Ss within each of the training groups were 
assigned to one of three subgroups based on 
their prephrase test scores Score ranges 
were selected that would keep at & minimum 
the number of Ss dropped to obtain the 
proportionality necessary for the statistical 
analysis. Forty Ss were included in each of 
the three low-ability subgroups, with scores 
ranging from 11 to 33 points. Fifty-three Ss 
were included in each of the three average- 
ability subgroups, with scores ranging from 
34 to 53 points. Thirty-two Ss were assigne 
to each of the three high- bility subgroups, 
with scores of 54 to 83 points. The training 
and control groups each contained 125 Ss 
with 375 in all groups. 


Sixty-one Ss were €. ted in obtaining 


limina! 
in the subgroups. 
4 Ability grouping based on prephrase test 
performance was used in all comparisons for 
two reasons. First, this method allowed for 
comparison of similar groups; second, cor- 
relation ients computed between 
phrase test scores and the various ISRT sub- 
test scores, using all Ss, indicated that ability 
the phrase test were related 


differences on 
to performance on these subtests (7's ranged 


from .43 to .69). 
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The majority of Ss eliminated had missed 
missed one or more reading tests; and within 
this group, Ss were eliminated on a chance 
basis. Of the 375 Ss finally included in the 
study, 32 had been absent for one testing 
period. In these cases, the mean score for 
their training group on a particular test was 
used to replace the missing individual score. 
Classroom teachers administered all as- 
pects of the testing and phrase-training pro- 
gram in this investigation. The pretests were 
administered on March 17 and 18, 1964, in- 
cluding the ISRT-AM and Phrase Test A. 
The 5,200-phrase group began training on 
March 19 for ten consecutive school days, 
Ss im this group saw each of the ten films 
in sequence, with each film shown twice on 
the day of practice. The 2,600-phrase group 
continued in the regular classroom program 
for five days and had five days of phrase 
training beginning on March 26. They saw 


Films 1, 3, 5, twice during 
this time. The teachers in both 


Rzsuvrs 
A three-factor 
design was used 


(5,200-phrase, 2,600-phrase, and no- 
phrase groups) and ability levels (low 
average, and high) as constituted on 
the prephrase reading test, The within- 
factor common to all analyses con- 
sisted of test periods (pretest, posttest, 
and postposttest). Separate analyses 
were made on each criterion measure, 
including the criterion phrase tests and 
the standard subtest scores from the 
ISRT: Rate, Comprehension, Dj. 
rected Reading, Paragraph Compre- 
hension, and Word Meaning. To ob- 
tain additional information on changes 
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in reading rate, analyses were also 
made on the total number of items at- 
tempted on the ISRT Subtests: Com- 
prehension, Directed Reading, Para- 
graph Comprehension, and Word 
Meaning. я 

То make for economy in reporting 
the results of the various analyses, the 
main effects associated with ability 
groups and test periods can be sum- 
marized immediately, since these ef- 
fects were similar in all analyses. 
Ability level was significant in the ex- 
pected direction (p < .001) as or 
performance over test periods (p 
.001). Test period and ability group 
interactions will not be reported BD 
the scores in these interactions are F p 
lapsed across training groups, and, 1o 
this reason, are not of interest. 


Criterion Phrase Tests 


The analysis of variance showed 
significant effects associated the 
training groups (p < .001), and ter- 
Test Period x Training Group 1 0 
action (р < .001). The character 
this interaction is shown in Figure r- 
A simple analysis of variance was p 
ried out at each test period (Linda 
1956, p. 47). The results of these an on 
yses showed no reliable diferenci ine 
the pretest among the groups, and po 
nificant differences among the gto" st. 
on the posttest and the postpostte i- 

he t tests between the means 200- 
cated that on the posttest the 5; tter 
phrase group’s performance was y 
than both the other groups (p z per- 
and the 2,600-phrase group's P 
formance was better than the ost- 
phrase group (p < 01). On the Pyas 
posttest, the 5,200-phrase group s 
again better than the other two £ - in 
(р < 01). However, the differen h ce 
performance between the 2,600-phr^ t 
group and the no-phrase group W25 
Significant. 
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ө 5200 Рһгаѕе бгоџр 


2600 Phrase Group 


а 
Ж 
5 50 
н 
o 
© 
E 
- 
Z 
js pui 
9 No Phrase Group 
Н 45 
8 
March April May 
5 Pretest Posttest Postposttest 
Ia. 1 "е ә А " 
" ‚ Criterion phrase test for training groups at test periods (Experiment ID. 
RT R 
a : à 
te .005). Examination of the mean scores 


Th + 
effec icon io showed a significant 
(р < : aper with training groups 
05), and the Test Period X 


Tainin 
001) 18 Group interaction (p < 
cated a Spection of the means indi- 


Pw performance increase 
El he 5,200-phrase group. A 
Sroups at alysis of variance among 
“8nificant each test period showed a 
Dosttes E ifference on only the post- 
To vorin 8 
a 4 E 001)* the 5,200-phrase 
Chaka cheno 
| їп тенен of the interac- 
e 4, 


von 
See 


tio. 
"Xperi N 
periment II are shown in 


1, S R 
Tu 
Comprehension 


he š 
s ct ane showed a significant 
Theta ы with the Test Period 
5 Group interaction (p < 


indicated that the interaction resulted 
from the 5,200-phrase group's marked 
gain and loss in performance over the 


test periods. 


ISRT Directed Reading 

The analysis showed significant ef- 
fects associated with the Test Period 
x Training Group interaction (p < 
.001). Examination of the mean scores 
indicated that the 5,200-phrase group 
made greater gains over the test pe- 
riods than did the other two groups. 
An analysis of variance at each test 
period showed no reliable differences 
among the group means on the pretest, 
a significant difference on the posttest 
(p< 005), and on the postposttest (p 
« .005). On the posttest, ¢ tests be- 
tween the means showed a significant 
difference only between the 5,200- 
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TABLE 4 


FORMANCE OF 5,200-, 2,600-, AND 
OPERATE Groups ом ISRT SUBTESTS 
(ЕхрЕВІМЕМТ II) 


Е March | April |May post- 
Training groups pretest | posttest | posttest 
Reading rate subtest 
5500; phttan 17.49 | 161.83 | 162.11 
E 28.16 | 25.64 | 24.91 
ӨР 6.11 9.06 9.7; 
2,600 phrase 147.86 | 156.22 | 150.08 
$D 22.64 | 23.33 | 21.70 
on 6.34 8.16 6.81 
No Plens 147.78 | 156.02 | 151.18 
SD 22.54 | 20:08 19.28 
GE 6.33 8.11 7.02 
Comprehension subtest 
5,200 phrase 
M 140.65 | 151.97 | 146.93 
SD 18.70 | 14:96 16.84 
GE 5.53 7.00 6.37 
2,600 phrase 
M 142.61 | 146.68 | 147.34 
SD 17.46 | 20.4 21.66 
GE 5.81 6. 6.42 
No phrase 
143.72 | 149.89 | 150,05 
8р 16.59 17.54 19.85 
Directed reading sub- ч ae nd 
test 
550 phrase 
14.13 | 159.93 | 157.85 
SD 18.02 19.85 18.24 
GE 6.11 8.79 8.03 
200 phrase 
148.37 | 152.07 | 150.57 
SD 20.98 19.78 $3 
GE 6.64 7.11 6.91 
No phrase 
140.00 | 156.20 | 152.39 
SD 16.02 16.75 15.83 
GE 6.40 7.74 7.14 
Paragraph comprehen- 
Bion subtest 
5200 Phraso 
145.82 | 153.14 | 155. 
SD 21.19 19.81 20:15 
GE 6. 7.32 7.41 
2000 phrase 
148.15 | 148.90 | 149.1 
SD 20.01 19:24 | Arg 
Nine t. 6.59 6.72 
-01 | 151.63 | 149.98 
SD 16.59 16.51 д 
бЕ 6.70 7.07 D 


6.80 


Note.—N = 125 per Training group. 


phrase group and 2,600-phrase group 
(p < .01). On the postposttest, ¢ tests 
Showed significant differences favor- 
ing the 5,200-phrase group when com- 
pared to both the 2,600-phrase group 
and no-phrase group (p < 02). The 
difference between the latter groups 
was not reliable. 

The total number of items at- 
tempted on the subtest by Ss in the 
three groups was tabulated to deter- 
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3 
mine whether the 5,200-phrase group's 
gain resulted from increased rate гаш 
or comprehension. The analysis of t! à 
data showed a significant Test y 
riod X Training Group interaction. Р 
< .001). Inspection of the means m 
dicated that the 5,200-phrase am Я 
gain in correct responses from the P X 
test to the postposttest (Table 4) i 
accompanied by an increase in iE 
number of items attempted. Por 
this time period the 5,200-p vm 
group attempted about 15% шо 
items while making about 25% i р 
correct responses when compare 
the other groups. 


ISRT Paragraph Comprehension 


The analysis showed significant е 
fects associated with training ee 
(р « .05), and Test Period X 1). In- 
ing Group interaction (p < 2 iat 
Spection of the means indicated Б, 
the 5,200-phrase group and the when 
phrase group made greater gains oup 
compared with the no-phrase 8! each 
The ¢ tests between the means for t in- 
group from pretest to postposttes jne 
dicated that both phrase-tr p 
groups made significant reading Б БЙР 
(p < 101), while the no-phrase E^ 
made no such gains, Amount 0 h 
appeared to be associated wit 
extent of phrase training. А at- 

The total number of pna the 
tempted on the subtest by 98 deter" 
three groups was tabulated to ading 
mine whether these gains in int in- 
comprehension resulted from M gio 
crease in rate and/or comprehen the 
he analysis of these data Epon e 
groups made comparable gains gest 
number of items attempted see 
Periods (p < .001), but there Wer p 
reliable differences among the £ 
at any of the test periods. 
ee 


* Comparisons between. training pees 
Were not meaningful on this subter 
of marked differences on the pretest. 


8 
тор 
use 


| 
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ISRT Word Meaning 


b zending vocabulary test was 
omes etermine the comparability of 

^ eei among the training groups. 
oes in this instance was that 
been ue on à vocabulary test has 
all vmm to be associated with over- 
ine m ectual functioning. The pre- 
Seri Ores and the main effects for 
— Meaning favored the 2,600- 
en E and no-phrase group 
group (р dum ы the 5,200-phrase 


Exprrment III 


en periment III was designed to de- 
Would b if phrase-reading training 
em € useful with remedial readers, 
gram pe 4 the phrase-reading pro- 
Proaches, other remedial reading ap- 
Wo he investigation differed from the 
The Ss ler studies in several aspects. 
Cultie were children with reading dif- 
a rather than children in the 
Were ao program. The Ss 
avaras in grade placement and on 
leading ba 2% years retarded in 
and ^S ievement. Both parochial 
Cludeq ; le school students were in- 
e гч the current study, and they 
Schoo] к= the junior high and high 
nma as well as the elementary 
le to foi grades. It was not possi- 
further t Ow up the investigation with 
Studies а as was done in the earlier 
eroge: ecause the Ss were from а 
Srade Je neous group of schools and 
Bram M rn The summer reading pro- 
nary Tea T not supported by the ordi- 
the tep, |. 8 requirements available in 
gular school program. 


aterials 


he 
mateg wetirase-reading program was pre- 
in at and th, 16-millimeter projection equip- 
at E Perimet Се reading films described 
the Went ae П. The films were projected 
esent ed (16 frames per second) for 
Study, rather than at sound 
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speed as in the earlier studies. In this man- 
ner children with reading difficulties would 
have more time (214") between phrase pres- 
entations, with slightly longer to respond to 
the phrase as it appeared on the screen 
(3497)? 

А Students in the no-phrase remedial read- 
ing program reviewed structure analysis, 
base words, prefixes, suffixes, compound 
words, syllabication, dictional skills, vocab- 
ulary and compound words, and phonetic 
and content skills. Oral and silent reading 
were required throughout the program, with 
reading comprehension and speed reading 
exercises. Reading materials and workbooks 
were provided for students at their level of 
achievement. The materials used included 
the Reading Laboratory Kits by Science 
Research Associates; How to Become a 
Better Reader by Paul Witty; Reading 
Workbook by Charles Merrill; and Pho- 
netics Workbooks by Lyons and Carnahan. 
Supplementary readings including Pleasant 
Lands, Across the Blue Ridge, Engine 
Whistles, Trails to Treasure, Parades, etc. 


Procedures 
Subjects. The Ss were public- and paro- 
chial-school students attending the sum- 
mer remedial reading program offered by the 
Des Moines Public Schools. The program 
was open to all students in Polk County and 
an enrollment fee of $8.00 was assessed to 
defray part of the costs. Attendance was on 
a voluntary basis. 
Grade placement of the students in the 
study ranged from sixth grade through 
tenth grade, except for one eleventh grader 
and one twelfth grader. The average grade 
placement was 83. Fifty-two percent of the 
students were reading 2¥2 years or more be- 
low grade placement, with an average read- 
ing level of 58. In addition, 19% of these 


"This procedural change was, made be- 
cause the author believed that children with 
reading problems might have more difficulty 
making as rapid а discriminating reading re- 
sponse as students in the regular reading 
programs. Under these practice conditions Ss 
had more time to perceive the phrase on 

hrase would less 


the screen and the next p 
the perceptual process. 


likely interfere in ре 
"The school administered the Gates 
Reading Survey on the first day of summer 


school. Reading achievement. level was de- 
termined for each S by combining the three 
subtests (Speed and Accuracy, Reading Vo- 
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ы ave repeated ai 
ay pia poop d e reading level 
east oni А < 19 to 
in grade placement from 
"fw pede ИЕ disability pees de- 
tenned by comparing each S's ep p m 
ment in school with his level of rea: ing 
achievement. In this comparison $s range 
in relative reading ability from over eg 
years retarded to three years above ТИ 
ancy. Several students in the program had 
certain reading difficulties even though they 
were able to read at grade placement or 
рь Two experimental training groups 
were established to compare 8s in the phrase- 
reading training program with Ss in the 
usual remedial reading program. Three 
schools were included from different. social- 
economic areas. At each school one instruc- 
tor taught four reading classes. Two train- 
ing groups were established within each 
school by random assignment, so that е 
instructor had two classes in each group. In 
this manner each training group was made 
up of six different classes representing three 
different school areas, and each instructor 
taught students in each method of training. 

Students who remained in the no-phrase 
remedial reading program had instruction 
Over a wide range of materials and concepts, 
and used several established reading ap- 
proaches. The major goal in this program 
was to provide meaningful reading experi- 
ences for each individual student. 

Students assigned to the remedial phrase 
program had phrase-reading training as part 
of their daily reading program. The teachers 
were instructed to 


show the ten phrase-read- 
ing films at least twice during the course of 


the experiment to establish a minimum 
training program of 5200 phrases, Under 
these instructions, teachers could use phrase 
training in addition to other remedial read- 
ing procedures. 

The summer reading program covered 33 
daily class meetings Over а seven week pe- 
riod. Each class meeting was 64 minutes long 
and Ss in both training conditions spent 
the same amount of time in their respective 
programs. | 

Al Ss were pretested with the Iowa 
Silent Reading Test (ISRT) Form AM, 
Elementary Edition (Subtests 2-Directed 
Reading, and 4-Paragraph Com 


prehension), 
at the end of the first two weeks of the reg- 


ach 


cabulary, and Level of Comprehension) in 
the manner suggested in the manual. 
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ing the 
ular remedial reading program. wp sie 
remainder of the program, half о 1 program 
mained in the regular remedia D ing 
while the other half had the mM ed 
as part of their program. IS Sine 
administered at the end of the qoem of 
Conditions of posttesting ren t both 
pretesting and were similar for 5s 
raining conditions. h film 
à Two of the teachers showed а pios 
twice to the phrase classes for a маше was 
gram of 5,200 phrases. This prp How- 
comparable to the previous а dms the 
ever, the third teacher decided = шу у. 
phrase training program almost exc s about 
As a result he showed the film serie phrase 
three and one-half times with ue 0, 
classes, for a training program 2 analy- 
phrases. For purposes of the sabe pool 
sis, the phrase-reading classes ae ing 
and the average amount of phras 


to des 
experiences of 6,500 phrases was x 
ignate Ss in this training conditio he training 
The Ss were distributed in t 


F e-reat 
groups as follows: the remedial geet 1 
ing group, 67; and the d t 
reading group, 61. One hundr ith 
Ss were included in this study, à pr 
each training group to establish t ical 
lionality necessary for the acre proces? 
poses. The eight Ss eliminated in ts. Three 
were all absent during the реше absen 
Ss included in the final selection bis absent 
during the pretests and five Ss a SB. ij 
during the posttests. In these was use 
mean score for their training CONE оле on ® 
to replace the missing individual = 
particular test, at T 
The Ss within each of the training # ver 
Were assigned to one of three JEN isal Шу" 
ased on their relative reading a py tS 
Three ability levels were соп к тосу. 
method and were designated as e ling d 
ate, and severe with respect to ге 
ficulty. A 
In comparing the two ehar 
groups, it ts noted that the resulting od aid 
assignment of classes to training Es remedy 
not come out exactly parallel. er m 
phrase-reading group had an ave e adin? 
Placement of 8.7 with an aeon expect 
disability of 3.1 years below Braet ig gros 
tion. The no-phrase remedial dt 0, Ww! Je 
had an average grade placement © ow Е! Sa 
an average reading disability 1-8 roup ЧАР 
expectation. Thus, the phrase La pa 
composed of slightly more adv placem? ^, 
dents in terms of school grade yous! F 
among which were the more 56 
tarded readers. 


орог" 
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RESULTS 


A 
tater E analysis of variance 
four se indquist, 1956) was used on 
ures я sets of criterion meas- 
the total uding the standard scores and 
the шыр of responses made on 
graph C irected Reading and Para- 
two bet omprehension subtests. The 
"enc: factors common to all 
(6,500- LEGEN of training groups 
group) pres group and no-phrase 
erate du ability levels (mild, mod- 
ing debia with respect to read- 
mon to i y). The within factor com- 
Periods M analyses consisted of test 

sh etest and posttest). 
marized į of the findings can be sum- 
or readi mmediately. The main effects 
all four ng levels were significant on 
and in arterion measures (p < .001), 
amount m ease the Ss with the least 
he highe eese disability achieved 
effects hs reading scores. The main 
Cant liar periods showed signifi- 
eading I on the two Directed 
005: are (standard scores, Р 
but О or ems attempted, p < .001), 
lon ДЕА the Paragraph Comprehen- 
Thain off yses. There were significant 
ing the e for training groups favor- 
irecteq. DW phrase groups on the two 
Scores eading analyses (standard 
< 01) а :005; items attempted, р 
"lods x R he interactions of Test Pe- 
Ported нар Levels will not be re- 
Cross tr ce these scores are collapsed 
Teason aining groups, and, for this 
; are not of interest. 


ISR . 
Т Directed Reading 


Th 
Showe, eoe of the standard scores 
ost ae cep ori Training Group 
le char riod interaction (p < .05). 
Shown jn Anter of this interaction is 
cang h 5. Examination of the 
esulteq oe that the interaction 
arked ү om the 6,500-phrase group’s 
eading gain when compared 


TABLE 5 
Perrormance ОЕ 6,500- anD № 
A ,900- AN o-PHRA: 
Groups on ISRT DIRECTED Paomia 
Susresr (Expentuent III) 


Training groups Pretest Posttest 


6,500 phrase 
M 149.57 155.92 
GE 6.76 7.69 
No-phrase 
M 144.15 145.02 
GE 6.12 6.20 


Note.—N = 60 per Training group. 


with Ss who received the remedial 
reading program without supplemen- 
tary phrase reading training. 

The total number of test items at- 
tempted were tabulated for all Ss to 
determine whether the 6,500-phrase 
group’s gain resulted from an increase 
in rate and/or comprehension. The 
analysis showed a trend associated 
with the Training Group X Test Pe- 
riod interaction. The phrase group 
made about 12.5% more correct re- 


sponses from pretest to posttest than 
did the no-phrase group, while at the 
same time attempting only 7.5% more 
items. (All Ss made correct responses 


on about 70% of the items attempted.) 


ISRT Paragraph Comprehension 


Training Group X 
action, similar to t 
Directed Reading su 
the differences between 
not reach significance. There were no 
accompanying changes in reading rate 
as estimated by the number of items 


attempted.? 
ии 
means: 6,500 рһгазе 


в Standard score 
group, pretest 142.42, posttest 146.98; no- 
phrase group, 143.83 and 144.16, respectively ; 


(p € 10). The trend favoring the phrase- 
reading-trained s on Paragraph Compre- 
hension was in the same direction as the 


Discussion 


In Experiment I, the tachistoscopi- 
cally presented, meaningful word 
phrases helped intermediate grade- 
School children improve their reading 
comprehension. After the experimenta] 
training period, students assigned to 
the 5,200-phrase training program 
were reading significantly better than 
their matched counterparts in the 420- 
phrase training program, The magni- 


graph Comprehension Subtest 3 
months after training, but comparable 
findings were evidenced on the ITBS 
Reading Testo 

To determine whether the 
comprehension might b 


experimental groups, about 7%, There- 
fore, this would not likely account for 
the differences in reading comprehen- 


significant findings reported in the earlier 
two studies, 
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sion. These results suggest that there 
Was an actual gain in reading com- 
Prehension independent of changes in 
reading rate, эй іа 

Experiment II was designe S 
cross-validate the findings of eo 
ment I. In Experiment II, the 5,200- 
Phrase group did significantly ы 
9n both post phrase-reading tests t a 
either the 2,600-phrase group or s 
phrase group. The changes in phra 5 
Teading for Ss in the 01200: rd 
training program were favorably & 
sociated with gains in reading rate kn d 
Teading comprehension as indica a 
by the reading achievement test i 
sults. The findings were less consistent 
for students in the 2,600-phrase tr м4 
ing program, The difference in the nis 
Sulis between the two experime em 
may be due in part to the chang 
made on the criterion phrase tests p^ 
in part to the differential eg a 
of the revised 5200 Phrase Film Ser 
used in the Second investigation. hrase 

In Experiment II the 5,200-p^ ars 
group made more than 2⁄4 rar 
greater gain in reading on the her 
Subtest when compared to 0 ar 
groups. The reading rate ри E 
Were established without loss of re 2 
ing comprehension, for there aed 
differences among groups on the ame 
Prehension Subtest over the rei 
materials, The 5,200 - phrase - tral ain 
Stoup made over a year greater A Я 
in reading оп the Directed prs 
Subtest when compared to the ied 
training groups. Differences poe 
the 2,600-phrase group and по-р ding 
Stoup were not, reliable, The d 
gains for the 5,200-phrase group 5 0 
Accompanied by significant ite а 
the number of items attempted 
жс a 

"Since the 5,200-phrase group had aning 
cabulary disadvantage on the word mean? 
test, their superior reading gains Wo" + of 
ikely be due to some chance assignm 


brighter students to their group. 
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ео 
TA mde to the other groups, in- 
Scores : eirca in the reading 
reading rate, differential changes in 
more qo 200-phrase group also made 
M (бае а year gain in reading in 
tes whe Ph Comprehension Sub- 
аб до] Пе students in the regular 
improve i made no significant 
p meti Reading gains for the 
associat d groups were favorably 
as indies with the amount of practice 
{ tees by a comparison of the 
Phrase ры, group with the 2,600- 
i Alii a There were no reliable 
e oia] | among training groups in 
9n the Pa number of items attempted 
est. А, Tagraph Comprehension Sub- 
the hess arently, the greater gains for 
increase Se-trained groups were due to 
and not. In reading comprehension 
The findi to changes in reading rate. 
ere со ings of the 5,200-phrase group 
rst inp to the findings of the 
ments I estigation. In both Experi- 
Í the rs II the reading progress 
Sistent, fo; 00-phrase group was con- 
readers ы low, average, and superior 
follow.. and was maintained during 
xp кыз test procedures. 
tigate CU III was conducted to in- 
dining the use of phrase-reading 
remedia] with students in a summer 
ngs su reading program. The find- 
struction est that phrase-reading in- 
method c tie a useful supplementary 
for sty 9! remedial reading education, 
reat, ents in this program showed 
Me i on on the criterion reading 
Year неа They made over % of a 
he su Te gain on the Directed Read- 
tay, eg than did the students 
“edureg ith the usual remedial pro- 
Paragra here was also a trend on the 
Voring 4P4 Comprehension subtest fa- 
Phrasen e Ss who received remedial 
eg we training. These find- 
Previo Te consistent with the two 
8 studies. This study was а 
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somewhat more rigorous test of the 
usefulness of phrase-reading training, 
however, since the phrase-reading pro- 
gram was compared directly with al- 
ready established reading approaches. 

The Phrase Reading Film Series— 
Intermediate Grades was developed to 
provide extensive practice in the rapid 
reading of meaningful phrases. The 
phrase-training program was designed 
to increase the accuracy, speed, and 
span of recognition of reading ma- 
terials, The program provided a highly 
structured learning situation which 
reduced distracting stimuli during 
training and provided immediate in- 
formation on phrase-reading progress. 
Motivation for reading was increased 
in the novel learning situation. The 
results of the research indicated that 
this technique of phrase-reading prac- 
tice did significantly enhance reading 
performance for intermediate-grade 
children and remedial readers at all 
reading levels. The reading gains were 
maintained in follow-up testing pro- 
cedures after the phrase-training pro- 


gram was completed. 
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DESCRIPTION OF JUNIOR COLLEGES 


JAMES M 
Db: 
— S; JR.,! LORRAINE M. RAND, ax» LEONARD Р. RAND 
American College Testing Program, Iowa City, Iowa КЕ 


In a po i i 
TS purs men of 581 accredited junior colleges, measures of 36 ma- 
s were intercorrelated. With unity in the diagonal, a 
Й 


principa i 5 
pal components analysis was carried out, extractin: 
xtra g 


12 factors 


with envi 
rotatad hire gd greater than 1.00. The first 6 of these factors were 
a final solution through the Varimax procedure. The 6 ro- 


tated fac " i 

i a were titled: Cultural Affluence, Technological 

The | ‚ Size, Age, Transfer Emphasis, 
junior college factors are not congruent with 


colleges. 


à Special- 
and Business Orientation. 
factors for 4-year 


In the past 7 
leseribing ine 7 years, several ways of 
— istitutions of higher educa- 
hare a tried. Pace and Stern 
haracteri á developed the College 
alse iny stics Index (CCI), a true- 
eatures ee which measures 30 
Of the ms the environmental “press” 
haye da. Astin and Holland (1961) 
“sessment, p: the Environmental 
tempts t Technique (EAT), which 
erms of O assess the environment in 
Student p SENS. characteristics of the 
ence, mda : its size, average intelli- 
Realistic. T; "personal orientations" 
ntional 7s Intellectual, Social, Con- 
ased on и tain and Artistie— 
Cach of six е proportion of students in 
BAT wg eum of major field. These 
i га Subst es were found to account 
ғ; с ge amount of variance 
i5 € and later they were 
S as 9 ict the “effects” of the 
Mie Я 1963) роне by the student 
А ribe talle . Still another way to de- 
«Узі of ge environments is factor 
p "acteristicn “Ce measures of college 
лашу, c n (Astin, 1962, 1965b). 
en ы. lege environments have 
ial s Wed simply a f 
ties imuli, or « dà S a set о poten- 
ch of the о, 1 observable characteris- 
anging a ege that are capable of 
= he sensory input to the 
Ry: he 
Sees tea are indebted to Max R. 
Serine or о алй H. Bradley 
Я ical reading of the manu- 
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student attending the college" (Astin 


1965a). 


Previous studies of college environ- 


ments, however, have been restricted 
to four-year colleges granting the bac- 
calaureate degree. The nearly 600 ac- 
credited junior colleges in the United 
States for the most part have been 
neglected, although there have been 
general treatments of junior colleges 
(Blocker, Plummer, & Richardson, 
1965; Medsker, 1960), sociological 
studies of single junior colleges (Clark, 
1960), and studies of the articulation 
between two- and four-year colleges 
(Knoell & Medsker, 1964). 

The basic purpose of the present 
study is to develop a description of 
junior college environments by or- 
ganizing the information currently 


available into а brief profile. Such a 
brief profile can be used both to char- 
acterize individual junior colleges, and 
in subsequent research to study the ef- 
fects of colleges on students more effi- 


ciently. The basic technique is a factor 
analysis of 36 measures of junior col- 
lege characteristics. This study, there- 
fore, is largely а replication in à popu- 
lation of junior colleges of Astin's 
(1962) study of four-year colleges. 


PROCEDURE 


ior Colleges 


colleges consisted of 
This 


Population of Jun 


The group of junior 
581 accredited, two-year colleges. 
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i П junior colleges for which 
арава n uon y via cho 
3),* wit! e ex j 
Perd alee 0 s aon al for the train- 
E P riests, members of religious orders, 
ps “phe sole restriction for inclusion in 
pod ead Junior Colleges is that the college 
be reeognized by regional or state — 
ing agencies. Therefore, the group of e 
leges studied should. be considered the 
population of accredited junior colleges, 
rather than a sample of some population. 


Measures of J'unior College Characteris- 
tics 

Thirty-six institutional variables were 
selected for study. The choice of variables 
had two primary aims: First, to include at 
least some data for all methods which are 
currently used in characterizing institu- 
tions, and, second, to include as many as 
possible of the variables Astin (1962) used 
in his study of four-year colleges (and to use 
the same scoring system he used). Unless 
stated otherwise, the information about 
junior college characteristics was obtained 
from American Junior Colleges (Gleazer, 
1963). In most cases the information in this 
compendium was reported by each junior 
college for the academic year 1961-62. The 
number of colleges for which data were 
available is indicated after each variable, 

Type characteristics. Among the most 
commonly used ways of classifying colleges 
are type of control and curricular emphasis. 
The following five measures of these charac- 
teristics were included in this study; 

1. Private versus Public Control—Publie 
score 0; private score 1. (ү = 581.) 

2. Degree of Religious Control—Non- 
denominational score 0, Protestant Score 1, 
Catholic score 2. This scoring system is the 
same as the one Astin (1962) used in his 
study of four-year colleges. (N = 581.) 

3. Liberal Arts Emphasis—No liberal 
arts curriculum 0, liberal arts plus other 
curricula 1, liberal arts curriculum only 2. 
(N = 574.) am 

4. "Teacher Training Emphasis—No 
teacher training 0, teacher training plus 


other training 1, teacher training only 2. 
(N = 574) 


* It should be noted that more than 200 
junior colleges have been established since 
the publishing of this compendium. At 
the present time—1966—no comprehensive 


source of information about these colleges 
is available. 
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3 is—No 
5. Technical Training кардашы in 
technical school training 0, pe ой 
ing plus other o 1, techni 
. (N = 574. . và 
u^ characteristics. Measures of ir 
financial characteristics were inc due simi: 
order to eliminate any correlation е al 
ply to differing sizes of junior ms tuition 
financial characteristics | ур e's total 
were divided by the junior ah pene on 
enrollment, thus a 
“ rs per studen н NS 
* ртты publie a M 
resident fees were used. (N = 53 S iN 
7. Endowment—Estimated ma К 
= 509. endi- 
к" ioris Budget Annua) Mo 
tures for educational and general P в 
= EDS 
u^ Capital Income—Gitts and арргор 
tions for capital purposes. (N plecti of 
10. Scholarship Funds— (N = 520.) 
money available for scholarships. following 
Student characteristics. The dent bo 
fourteen characteristics of the stu! 
were assessed as follows: — . 
1l. Percentage of Males in 
Body. (N = 571.) 
12. ад of Duae pi in 
in the Student Body. (N = 50 Students i 
18. Percentage of Foreign 
the Student Body. (N = 578.) ЕР 
14. Percentage of Part-Time 
the Student Body. (N — is MS 
15. Percentage of Students is 
or more of their College Expen " 
386.) der to о, 
16. Total Enrollment—In ort aa e 80020 
а more nearly normal distribution: ор t e 
оп this variable is the square tog, (N 
total number of students enro e" 
579. used 0f 
D. Aptitude Level—The agoe osit? 
this variable was the average atiné T " 
score on the American College f аррИ0%рд- 
fram’s national test, battery °° year 10 A 
to each college in the sender indicate 6 
63. Unpublished ACT researc c comi 08! 
correlation of .96 between viet соро ol- 
Scores of applicants and enn ente? | ost 
Scores of freshmen who actu& typi al KE 
leges. The ACT test battery liabilities fg- 
of academic potential, with 2n of the ACT 
validities against, grade oritare ests © 
nitude to be expected for Sue 314.) | e of 
Technical Report, 1965). (N T percenttÉ у, 
18. Realistic Orientation— 4, for? 
Students studying agriculture, 
engineering, etc. (N = 497) Берсе 
19. Intellectual Orientation 


non- 


alue. 


the Student 


udents 


udenté in 


£ galf 


e 
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9f students s i 

phie ents studying science, mathematics. 
бе орћу, etc. (N = 497.) | 

fig ache s Orientation—Percentage study- 
hi ш nursing, etc. (V = 497.) 

ке Pere qnm Orientation—Percent- 

N = 496) E accounting, secretarial, etc. 

n ici es d Orientation—Percent- 

d E political science, pre-law, busi- 

497 ) nistration, marketing, ete. (N = 
23, isti i 

tudse a io Orientation—Percentage 

496,) rt, music, journalism, ete. (V = 


Vari 
Sorbent 18-23 compose the heart of the 
tin & EGAL Assessment, Technique (As- 
38 to the and, 1961). There is some doubt 
Variables į appropriateness of using these 
an ipsatiy, n a factor analysis, since there is 
theless Ы relationship among them. Never- 
Cause the ey were included in this study be- 
n order A tap important information and 
Astin’s fine as nearly as possible 
Astin’s reg d study of four-year colleges. 
meaningful. eu addition, were clear and 
Те not, ge; which suggests that the results 
Scoring of пошу affected by the ipsative 
pudy, the EA EAT variables. In the present 
olds which АТ variables are based only on 
Урев. Stud clearly belonged in one of the 
eral arts” Mid. in an undifferentiated “lib- 
ша were not considered. 
aero Py. воре on this variable 
m DAT ов between the highest and 
ones) coll ariable. High scoring (homo- 
(hey опе c eges tend to have students in 
roj] Bene urriculum, while low scoring 
led in a none) colleges have students en- 
aculty oh de variety of fields. (N = 499.) 
+ Perce; aracleristics. These included: 
Ctoral ntage of Faculty Holding а 
igi Ване an Geek 
‘Asters тшн of Faculty Holding а 
Whi ariables 25 o; W = 577.) 
bis the ia vem 26 concern the extent to 
ap tlaurente dey has training beyond the 
Col] One-third 4f d On an overall basis, 
degr, © faculti of the members of junior 
fra е. Scores es have only a bachelor’s 
ls, B Ae these two variables differ 
Bes in p- (1962) stud; le 
Ong 1 that, in thi udy of four-year co 
time © total i this study, they were based 
e. aculty, both full-time and part- 


27 
ties Per 
[a ES of Faculty which is Full- 
fulg Facu] Б 
tin time fac Student Ratio—Number of 
€ students y divided by number of full- 
‚(М = 875.) 
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Miscellaneous characteristics. Included 


here are: 

29. Library Size—Number of books i 
the library. (N = 570.) аа 

30. Relative Library Size—Number of 
рле divided by the total enrollment. (N = 
31. Variety of Curriculum—Total num- 
per of different fields of study offered. (N = 

75.) 

32. Percentage of Graduates Going On to 
Four-Year Colleges. (N = 467.) 
. 33. Growth Rate—Percentage of increase 
in enrollment between 1958 and 1962. (N — 


476.) 

34. California Location—Colleges lo- 
cated in California 1, other colleges 0. This 
variable was included because Califor- 
nia has the most extensive junior college 
system in the country, ‘and we wanted to in- 
vestigate whether this system has any spe- 
cial characteristics which distinguish it from 
other junior colleges. (N = 581.) 

35. Age of Institution—Colleges founded 
since 1954 scored 0, colleges founded be- 
tween 1945 and 1954 scored 1, colleges 
founded between 1930 and 1944 scored 2, 
and colleges founded before 1930 scored 3. 
(N = 581.) 

36. Placement Service—Colleges having 
a placement service scored 1, other colleges 


scored 0. (N = 581.) 


METHOD 


tions were com- 


Product moment correla В 
3 Since not all 


puted among the 36 variables. 
scores were available for all colleges, & pro- 
gram which allows for missing data was used. 
Thus correlations only on those 


are based onl} s 
colleges for which data were available. This 


bias the correlations in un- 
or example, itis possible that 
s for which ACT test scores 
uite different from the 


sultant ion matrix was 
NR. by inci components 
method based on eigenvalues and eigenvec- 
tors with unity in the diagonal and extrac- 
tors wie an eigen Edd cem 

ig procedure, includini 

а m bis diagonal, js Harris’s (1964) 
‘actor analysis and it follows the 
presented by Kaiser (1960). A 
his procedure is that it 


major & vantage of t 


a 

3 All computa’ 
carried out at the О 
puter Center. 


tions for this study were 
niversity of Utah Com- 
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uces factors which are linear combina- 
ре of the observable variables, thus mak- 
ing it possible to compute factor scores 
(Kaiser, 1965). А . 

Twelve factors with an eigenvalue greater 
than 1.00 were extracted. However, several 
considerations—including an unsatisfactory 
preliminary rotation of all twelve factors, a 
comparison of communalities after extrac- 
tion of each factor with the highest correla- 
tion for each variable, and a plot of the 
eigenvalues—suggested that only half of 
these twelve factors should be included in 
the factor rotation. Accordingly, the first 
six factors were rotated to a final solution 
by the Varimax procedure (Kaiser, 1958). 
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RESULTS . 
Some of the variables involved in 
this study were highly skewed. For the 
most part, these variables are those "à 
Which a relatively large number of E? » 
leges had a score of zero. For qup 
many junior colleges (especially pub F 
colleges) have no endowment. AS z 
result the distribution of endowment | 
quite skewed. In such cases, of cours! i 
there is no transformation which i | 
eliminate the skewness. The ber $i 
tions among the various junior colleg 


TABLE 1 
Unroraren Factor Marnix, EIGENVALUES, AND COMMUNALITIES 
Factors ht 
Variable 
жш шжу Рт еер ев рее, 
ілш [у |v уп |уш[ ох | x | x | РЕГ 
1. Privata (versus public) 18 
E NN 7%) n -02|—o5| — RI 12 
$ Готе religious control | 58| 23 à 5 PR | 21-0 6 
4 Teacher training empha.| 10] 48—28) 27] -H|-H 06} 33] 24 18 
m 1 ТЮ) 58| ~16 | —o7 | -2| _ 66 
е ТЕТТЕ ИЧЕ" E 
1. Endowment/student 25 | d 5| i ER. Zo 7 ө 
Я Operating budget /stu- —29 | —3! Г 
E arr 57] 0 31] ~12 | ~o9 | — T 
9. Capital inco Е 0090—22) 13| оз | 19 
10: Seholarsbip шы. 7-18) 02 | Zis | Z9 0| 38| 06 | 06 65 
ц. Percentage of matey | 22 -a| 82] 98] -32] 19] -20] 25] a m 
12, Percentage of out-of-state OS} -39 | -16| o5 | =30 | | oi т 
student 
13. Percentage of foreign stu- 01 20} m o% O1)—24) 15| os] от 63 
еп 
И. Percontage of parttime) | 15) 3] 05! —a3| „| |] a и 
Students - - 
15, Percentage of students! 79 | 03] 14| asfi] a7] o 06 | —12 и 
Suing 33 of expenses | —41 | gp | _og 33 P 
16. Total enrollment 29 | 26) Go] jg} 3| 12) 33|—i8| о E 
11. Aptitude level of studenta| —13 | 28 30 B|-9|-04| 14| o 34 
18. Realistic orientation =38 | —62 | 24 | 05 | —37 | —30| 47 05} 31 57 
19. Intellectual orientation | —27 32 | -04 | ~22| -17] 00 | —10 18| 09 80 
20. Social orientation i3 | 55| 221| 93] 08) 16) Z3 | 38] 00 80 
21. Conventional orientation| 99 |. 0i 07 | T34 -3| 00| 25 | Z14 | Zio 63 
22. Enterprising orientation| —02 | L0} =o: #3) 9| 28] || 3 67 
23. Artistic orientation 13) 28| 22| $3) ~15 | ~50 | 33 | o| —34 
24. Homogeneity of enyiron- Б 10) 10] |7 —15 | —35 68 
ment 39 | — 
25, Percentage of faculty with S SE eds | dal ш а op 19 Т 
03 
26, Poreontage of faculty with иа К 3| o 66 
faster’s —08 - 
27. Percentage of faculty 48 | =53 | —10 | 09 | gg =01 | —02 | 20 65 
which is full-time — 23| 19| 1 —32 18 
28. Faculty-student ratio 29) -n | 20| 02| _34| -18 | -18 | о | от [n 
29. Library size —— -B| 48) 54| o|] 40| 13] 247 74 
30. Relative library size 53^ 16| j9|.jg| li|—-03| -o1 | 28| 07 7" 
31. Variety of curriculum | 60| 10 20 | Zia | 2| 44] 04|] 12|-—18 
32. Percentage of graduates 95| 02 | —o2 | —15 | =07 af 
going to 4-year colleges —06 42 | — Я 
33. Growth rute — = 9 18| 07| ав] 44| о й 
34. California location —4| 26| 54| mn 05 | 09 | —28 | —02 | —07 @ 
35. Аде of college - 33| 39 m|—is| 93) 16| 07| 18) ов 5 
30. Placement service -37| —07 29 13 02 d zm 2] 3 
Eigenvalue 5.70 | 3.44 | 2.39 | 2.08 1.70 | 1.56 | 1.52 | 1.32 | 1.29 
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TABLE 2 
Variax Rotation or First Six FACTORS 
A p^ с р" ES F 
Variable Techn gud ht 
|siness 
Cultural | logical s, Transfer isnt&- 
affluence speciali- Size Ase | emphasis | fon 
zat 
1. Private 23 63 
2 (versus publi trol 47 -4 37 E 
3. Libera] Of religious ен 4| -3 23 u | - 8 
4, Тое arts emphasis 06 -37 -% 49 ar i 
5 тозот training emphasis —05 24 08 ss j @ B 
б. Tuitinological emphasis —28 67 -09 02 ү: F1 
Тооп —01 -17 21 —30 T 5 
8. Ope o ment/atudent, 42 —05 00 08 a a 
9: Сарп budget/student 36 -00 5 | ol i E 
10. борца! income/student. 32 26 —03 -ü 0 p 
11, perelatship funds/student 48 —05 -0 9r [ ыу 5l 
12. percentage of males —21 64 —16 zu 1 m 
13; percentage of out-of-state students 39 EL E 8 07 43 
14. Percentage of foreign students 64 —0ô 06 in —02 09 
15; Percentage of part-time students EU 08 —64 
ехрє (аде of students earning !4 of 4 06 38 
16. Т Penses 00 09 20 —56 1 03 26 
17. Ape enrollment —22 09 83 -18 =a 5 4 
18. Бош Че level of students 12 30 23 03 E: E- E 
19. Intel tie orientation = 73 -02 —09 27 219 21 
2 Бос сш (otientation E: =ф pa = 67 02 5 
Я ientati 2 1 = = —26 
22. pohventional orientation 2 1-8 | 8 | 6 Г 57 | 38 
23. Аты rPrising orientation -17 -10 =й 21 00 08 22 
24. Ho istic orientation 16 -22 31 2 56 =й 55 
25, peMoxencity of environment 21 08 40 | 6 05 53 33 
20. percentage of faculty with PhD 14 00 C 07 65 —09 54 
27. Percentage of faculty with Masters | —27 | —H | 77 g 
ag, time ge of faculty which is full- i ов 06 60 zB E Е 
29, pielty-student rati 7| -49|-9| | w 08 58 
30. оташу size 9 11 —07 97 n 08 -2) 58 
$1. Уау library size a | = | 7R 0 й | -ю | 9 
3E. Percent of, curriculum -n n o и © 
8. year colleges graduates going to 4} — —08 -% =н A Е) 3 
4. Growth r =05 = = d -07 
às. California location E. 08 5-9 2 -11 52 
38; pre of college 09 03 13 E || 218 13 25 
“cement, service E 14 37 =й 
М 
Reflected factor, 
5 TABLE 3 COLLEGES 
IM R-YEAR 
Ашту Between Facrors кон Junior COLLEGES ANP ror Fou 
Junior college factors 
4-y Transfer Business 
"Year coll Techno- Ti : orienta- 
ege factors Gad logical Size Age emphasis tion 
eciali- 
affluence вр tion — 
А! ag | —.1304 .4580 
Renice soe; | 0108] -0638 2988 | 6135 | — 1808 
y alig s " „оде > . —. 
Size 1€ orientation —.1971 .6639 7236 71008 .0815 .0654 
азо]; —.0865 .3287 "1195 | —.3507 —.9957 | —.2998 
Hon Ulinity — .3888 ‚3677 o "0277 | — 4278 | — 1780 
PupjrEeneity 11394 | -2877 — E “9302 | —.1921 | .3542 
V8. private control “4410 | —-4806 | 7 
Not in (1962) 
е... by Astin (1962). 
4-year college factors obtained from study by 
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д teristics were computed* and 
са to factor analysis. The un- 
rotated matrix of the twelve factors 
with an eigenvalue greater than 1.00 is 
presented in Table 1. The factor solu- 
tion for the Varimax rotation of the 
first six factors is given in Table 2. 
Finally, in order to compare the 
factors obtained in this study with the 
factors Astin (1962) obtained for four- 
year colleges, the Coefficient of Con- 
gruence (Tucker, 1951) was computed 
between each rotated factor for this 
study and each of Astin’s rotated fac- 
tors. These calculations involved only 
the variables common to the two stud- 
ies. Results are shown in Table 3, with 
Astin’s factors rearranged so that, to 
the extent possible, highest, Coefficients 
of Congruence are in the diagonal. 


Discussion 


The rotated factors are briefly de- 
scribed and interpreted below: 


Factor A 


The variables with high loadings on 
this factor describe college which has 
a large number of library books per 
Student, relatively many foreign and 
out-of-state Students, and many faculty 
members relative to the number of 
students. It is privately or religiously 
controlled, and is relatively wel] 
financed. This pattern looks like the 
factor named Affluence by Astin (1962) 
in his study of four-year colleges. In 


* Tables showing Means 
tions, and Numbers of | \ 
Junior College Characteristics and Correla- 
tions among Junior College Characteristics 
have been deposited with the American, 
Documentation Institute. Order Document 
No. 8921 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library 
of Congress, Washington, D. ‚©. 20540. 
Remit in advance $1.25 for microfilm or 
$1.25 for photocopies and make checks pay- 
able to: Chief, Photoduplication Service, 
Library of Congress. 


; Standard Devia- 
Observations for 


the present study, however, x eem 
appears to involve facilities, suc re 
library and the faculty, more xi 
financial wealth. An appropriate a 
therefore, would be Cultural A fluence. 


Factor B 


Loadings on this factor get m 
college with a technological P pe 
and many students in technical Le 
grams, with many male pera e de ia 
few students studying such fie d. 
education and secretarial SE ma 
with few out-of-state students. indito 
publie school which does not emp id be 
the liberal arts. A good title wou 
Technological Specialization. 


Factor C 


H on 
Colleges that would score coat 
this factor have large enrollm Jum; 
a varied, heterogeneous вални 
many part-time students; and fe p this 
ment service. The best title е 
pattern would probably be — pe 
college scoring high would pro 5 apre 
an urban-centered, open-door, cO i 
hensive college, with a strong dition 
on continuing education. In ing col 
one would expect the high scorin mper- 
lege to be characterized by an i con- 
sonal atmosphere, few € culty» 
tacts between students and 


4 t su 
Several highly organized studens 


Factor D pich js 
Loadings represent a qe ents 
old, Which has faculty and stus fe 
Who are both full-time, whic iy many 
working students but relative 
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out- 

Seber students, which has not 
ne which spends relatively much 
hemes jos student, and which is a 
would сом An appropriate title 
wodi Age. The high-scoring college 
n erae d resemble a small, 
likely E iberal arts college. It would 
dential ave many traditions, & resi- 
as student body, and an adminis- 
acting tig conceived of its role as 
would 2: осо parentis. Such a college 
" уйре ын be likely to have a selective 
sarily T policy, although not neces- 
aptitude нр emphasizes academic 
fore, wen An alternative title, there- 
T ght be Traditional Exclusive- 


Factor E 


C 
Senne characterized by the varia- 
Size tea a high on this factor empha- 
and offer 4 i training and liberal arts 

hey M heterogeneous environment. 
Such fields many students studying 
ates who as education, many gradu- 
and ma, go on to four-year colleges, 
ter’s сае faculty members with mas- 
o net of fh A common denominator 
Ment for В ese variables is а require- 
Junior coll urther education beyond 
Braduates Es and, accordingly, many 

baned of high-scoring colleges seek 
this fact, training. The best title for 
Emphasis, would probably be Transfer 
‚Ош 
ing len Оша expect the colleges scor- 
Colleges on this factor to be terminal 
tical vo primarily concerned with prac- 
Cort арыт training, making little 
What ha аал their curriculum on 
Colleges T een traditional for four-year 
© conce he high-scoring college would 
Song PEDE more with preprofes- 
р. » exploratory training. 

actor р 

The h; 
tor "ie high-scoring college on this fac- 


as r 4 
fields аад many students 1D 
aracterized as Enterprising, 
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relatively many faculty members wit! 

Ph.Ds, high tuition, bright D 
and many out-of-state students. In 
addition, it spends an above-average 
amount of money per student. The 
interpretation of this factor is less 
manifest than was the case for the pre- 
ceding factors. As an aid in the inter- 
pretation, colleges with high scores on 
the Enterprising Orientation variable 
were identified. Two types of colleges 


appeared to predominate. First, small 
private colleges on the East coast with 
ing sales and re- 


many students study 

tailing, and, second, large public col- 
leges on the West coast with many 
students studying management. The 
trait common to these two kinds of col- 
leges appears to be an emphasis on pro- 
viding students with a business skill 
having immediate utilitarian value. 
This factor, therefore, might best be 
named Business Orientation. 

The comparison of the factors for 
this study with Astin’s (1962) factors 
for four-year colleges indicate some 
similarity, but in no case was the Co- 
efficient of Congruence high enough to 
justify consi rs identical. 


In the present study, of course, no at- 
tempt was made to use 

tion as a criterion in the rotation of the 
r college factors. On the other 
dent analytic rotation 
e stringent test of 
an is using one 
or the rotation 


analytic 
matches 
some cases 
Therefore, th 
that junior colleges 
four-year colleges; and 
‘opriate to apply ® classifi- 

cation SC ped for one type 
of college to 
The primary goal of this study was to 


provide brief pro 
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characterize junior colleges, 
Rd will make possible more effi- 
cient research on the effects of Junior 
colleges on their students. It seems 
clear that this goal was attained, for 
the original 36 scores were reduced to 
six factors which are reasonably clear 
and easily interpreted. The reduction 
to six representative factors provides 
a simple, economical set of items for 
assessing junior college environments 
in research on the ways in which dif- 
ferent colleges affect student accom- 
plishment and growth. We hope, how- 
ever, that this profile will be regarded 
only as a first step, that junior colleges 
will now receive the attention from re- 
searchers that is warranted by their 
importance to society, and that this 
attention will in turn lead to better 
descriptive schemes, better studies of 
junior college effects, and, ultimately, 
to better junior colleges, 


REFERENCES 

American College Testing Program. ACT’ 

Technical Report. Iowa City: Author, 
965. 


Astin, A. W. An empirical characterization 
of higher educational institutions. Journal 
of Educational Psychology, 1962, 53, 224— 
235. 


Astin, A. W, Further у; 
vironmental Assessment Technique. Jour- 
nal of Educational Psychology, 1963, 54, 
217-226. 

Astin, A. W. Recent studie: 
vironments. Paper read 
Personnel and Guidance Ass, 
neapolis, April, 1905. (a) 

Astin, A. W. Who goes where to college. Chi- 
cago: Science Research Associates, 1965. 

b 

dim А. W., & Horraxp, J. L. The En- 
vironmental Assessment Technique: A 
way to measure college environments. 
Journal of Educational Psychology, 1961, 

52, 308-316. 


alidation of the En- 


S of college en- 
at American 
Ociation, Min- 


J. M. Ricuarps, JR., L. M. Бахр, anp L. P. RAND 


BLockzn, C. E., Рисммен, R. H., & Rc 
ARDSON, R. C., Jn., T'he eo eee DN J: 
A social synthesis. Englewood Clifis, N 
ice- 1965. 
ae pw open door college. New 
York: McGraw-Hill, 1960. i sudor 
Gueazer, Е. J., Jn. (Ep.) woe ШУ ЕП 
colleges. Washington, D. C.: Am 
Council on Education, 1963. xus: 
Harris, С. W. Four models for Кысу 
ysis. Paper read at American Psyc! bes 
cal Association, Los Angeles, Septe 
964. ic 
ок, Н.Е. The application of giso d 
computers to factor analysis. EC 90, 
and Psychological Measurement, , 
141-151. ib 
Kaiser, H. F. Psychometric арргово? е 
factor analysis. In Proceedings prob- 
1964 invitational conference on tening MR 
lems. Princeton, New Jersey : Educa 
ing Service, 1965. И. куз 
дае The Varimax Criterion By, 
analytic rotation in factor 
Psychometrika, 1958, 23, 1877209, Factors 
Кхови, D. M., & Mzpsxzn, L. "studens 
affecting performance of {гала[ар Scale 
from. two- to four-year colleges. f Higher 
California: Center for the Study 0 
Education, 1964. ress 
Мерзкев, 1: L. T'he junior college: Profil, 
and prospects. New York: МеСга 
1960. 


$ h to 
Pace, C. R., & Srern, G. G. An appen 
the measurement of psychologica Journal 
teristies of college environments. 49, 209- 
of Educational Psychology, 1958, 47+ 
277. f college 
Ricnanps, J. M., Jr. Life goals 0 Psi 
freshmen. Journal of Counseling 
chology, 1966, 13, 12-20. tic study 
Ricuarps, J. M., Jn. A factor analy eshmen- 
of the self-ratings of college Missi" 
Educational and Psychological | 
ment, in press. : college 
Sanrorp, N. (Ep.) The American А 
New York: Wile , 1962. is 0 
UCKER, L. R. E icd of synth ch 
factor analysis studies. Personnel pingto?’ 
Section Report, No. 984. Was 
Dept. of the Army, 1951. 


(Received June 21, 1965) 


Journal о, 
of Ed 
1966, Vol. 57, азала Fauchology 


COMBINING RESPONSE MODES IN PROGRAMMED 
INSTRUCTION’ 


JOANNA P. 
University of 


A group of 120 6th-grade pupils wo: 


@ 1 versions: standard constructed 
ultiple-choice format (B); а combination 


8 
tructed-response mode was used 


terminology as responses and the multipl 
was required (C); 


where general vocabulary 


{йай in which response-mode was uncorrelate 
oups A and C were significantly bette 


constructed-response posttest and 


ie that the use of a varied format may not 
mance. Rather, the important variable may 


WILLIAMS 
Pennsylvania 


rked through a linear program inl 
-response format (A); standard 
format in which the con- 
on frames that required technical 
e-choice mode on items 
and a combination 
d with item-type (D). 
r than Groups B and D on à 
in scores. The findings sug- 
in itself facilitate per- 
be the correlation be- 


on ga 


tween training mode and type of item. 


Ani > 
Son Кра method is evaluated 
First, 8. io two general criteria. 
must, bon effectiveness of the method 
itis sup е, Does it teach what 
Way, q osed to teach? Or, put another 
terion E es performance on the cri- 
cond = reach the desired level? 
must be he efficiency of the method 
Methods gon into account, for 
amounts of at require unduly large 
onstru training are not practical. 
ii Ws ее on nm (CR) train- 
ear ch the student is required to 
Compare ge ене to every frame, has 
With al] favorably in effectiveness 
Rated eng modes typically investi- 
& We» Holland, 19605 Krumboltz 
n Owever СД ‚1962; Williams, 1963). 
Mefficient: this mode is relatively 
ount of Less is learned in а given 
Tàinin time than with any other 
1965. mode (Goldbeck & Campbell, 
me villiams, 1963). 
Niger, tible that determines the 
effectiveness of different train- 


Ing, 
Wri 


? This 
Dope th was supported in part by 
es Ome Research Program, United 
€ ашар 0 Education (Project 42100). 
Bragg ыт 10 indebted to Pilen I. Levy for 
S daa in the collection and analysis 
Shica and to Coronet Instructional 
uce ROI Tllinois, for permission to 
Toupin a portion of their program 
E Animals: What is a Mammal?". 
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S 


odes is the type of item used on 
terion test. CR training, for 
r to multiple-choice 
(MO) training on test items that use 
novel or technical terminology, but 
not on items that use general, familiar 
vocabulary. This has been demon- 
strated with both new and review 
material, and in both elementary 
pupils and college students (Williams, 
1963, 1965). As usual, however, 

training required more time, and the 
overall efficiency of that mode was 


lower. 

These findings SU: 
which it might be p 
the efficiency of a progra 
diminishing its effectiveness: ( 
training on frames requiring technical 
terminology and MC training оп the 
other items. How would such à pro- 
gram compare in (a) effectiveness and 
efficiency with a program com- 

R frames and one 
he present 


es programs of these 


ing m 
the cri 
example, is superio 


ggest one Way in 


ossible to increase 
m without 


Use CR 


which is en 
study compar 
evaluate such a com- 
erly, one 


the concurre 
modes. Since 
a mixture of tr 


5 
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known (Taber, Glaser, & Schaefer, 
1965), a fourth program was designed, 
which also used the same two training 
modes, but in which there was no 
correlation between training mode and 
type of item. That is, frames requiring 
technical responses and those requiring 
general responses were assigned ran- 
domly to CR presentation or to MC 
presentation. 


Метнор 


Subjects 


Subjeets (Ss) were 120 sixth graders who 
had not been previously exposed in school 
to the material covered in the program. 
An additional 14 Ss were dropped: 11 for 
failure to complete all training and testing 
sessions, and 3 to equate pretest scores and 
Ns in the experimental groups. 


Materials 
The first 120 frames of 


gram “Grouping Animals: 
mal?” 


the Coronet Pro- 
What is а Mam- 
n booklet form, 
mes, 44 requiring 
response 


page. The S was requi 
answer for every frame 
sheet. In Group B, all 
the MC format. Unde 
in the same way as in 
tive responses were printed. The S Was re- 
quired to underline hi 

the correct response 
In Group C, those 44 
technical terminolog 
presented in the CR format; the other 44 


abulary, 
n Group 
from the 


were presented in the MC mode. I 
D, 44 frames, randomly selected 
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88, were presented in the CR format, and 
the other 44 appeared as MC items. Р 
All Ss recorded their responses on 3 
separate answer sheet, which contame 
blanks for the CR items and the two alterna- 
tive choices for the MC items. The Meum 
that were presented as complete statemen 
and required no response were indicated by 
a row of X's by the appropriate number. ms 

An objective test was made up of 16 Е 
that required constructed responses. A 
the items required S to utilize yw Ha 
terms introduced in the program (suc half 
vertebrae and phylum), and the other ith 
were items to which S could respond a à 
familiar words in his standard vocabu m 
An additional 16 items, which required | 
choice between two alternative eee 
were also given; half demanded ei 
terms as responses, and half were © ТВ? 
familiar vocabulary type. These were ann 
lyzed as two separate tests, but EF a i 
presented to the subject as one GS 
which items of both kinds—CR and } 
were mixed together. 

Items were balanced so that every 
nical term which was used as a resp M 
appeared both on the CR and on. Me into 
tests. In addition, the test was divide com- 
two parts, each of which had to be each 
pleted independently. In each һай, ог 
response term appeared either аз ап 
as a CR item, so that there coul 
cueing from one type to another. 
Ss were given Part 1 first and then 
for the other Ss, the order was reverse 


tech- 
onse 


Procedure 


of 
The Ss were randomly assigned 10 отет 
the four training conditions. All 8% an 
given the objective test as a pretest, riy. 
Sixteen minutes were allowed (eight cce?" 
utes for each half). On each of two ө вуз 
sive days of training, starting three р 60 
after the pretest, Ss worked iio by 
frames of the program. The time pron 
each S to work through each day rollos i 
ment was recorded. On the day їо 977, 
the completion of the program, Sh 
again given the objective test W! 
Same time limits as in the pretest- 
finished the test in the time allowed. 
efore the experiment began; jn pre” 
told that this was an experiment 17 dass 
grammed instruction and that thei (5p. 
had been specially chosen for partici score? 
t was pointed out that their test sc oo! 
would not count as part of their ® as. 
grades, but that they should try t° 
well as possible. 


were 


Response MODES IN PROGRAMMED INSTRUCTION 


е TABLE 1 
MEA) 
N Numper Correct on CowsTRUCTED-RESPONSE TEST 


Pretest Posttest 


Training mode uh 
ech- |General | Ovi Tech- 
Rin | items ФИ шр | шшщ; саа Оза вр 
А (Allo — 
B (An een response) 0.73 | 2.23 | 2.96 | 2.12 | 3.30 | 4.70 | 8.00 | 3.90 
C (Оокат Б oice) 0`60 | 1.97 | 2.87 | 2.03 | 2.07 | 4.03 | 6.10 | 3-21 
Coe 0.50 | 2.17 | 2:67 | 1.88 | 2.87 | 4-13 | 7.00 | 3.72 
, uncorrelated) 0.57 | 2.10 | 2.67 | 1.65 | 2.40 | 3.70 6.10 | 3.05 


RESULTS 


N 
ee and standard deviations of 
Шоп ier oe measures on the CR 
oth E are presented in Table 1. 
normall E and posttest scores were 
significan ; istributed. There was а 
Posttest gru from pretest to 
6.06, af L he /s ranged from 4.97 to 
Contro] gx 58, p « .01). An untrained 
sary si p was considered unneces- 
1965) Sedo previous work (Williams, 
Pretest, icated no difference between 
CHR tara posttest following similar 
EE d also presents the mean 
MN correct responses On the 
training mod test as a function of 
di, mode and type of test item. 
here onten by analysis of variance, 
trainin ere no differences among the 
(P< D je groups on the pretest 
items г, cores on general vocabulary 
techni were higher than those for 
/ i items (F = 105.34, df = 
Significant < .001). There were no 
Е чет interactions (F < 1). On 
test test, as on the pretest, type 
showed item (general > technical) 
ee "Up ge effect (F = 84.63, 
the difta 16, p < .001). In addition, 
reached rences due to training mode 
(Pl А ЛО level of significance 
Variables did df = 3/116). The two 
[ы 3/116). not interact (F = 1-03; 
fade ake priori predictions had been 
about the training mode varia- 


ble, specific comparisons Were made. 
They indicated that Group A (all 
CR) was not significantly different 
from Group C (combined modes, 
correlated with item type) (F = 1.23, 
df = 1/116), put that Groups A and C 
together were significantly higher than 
Groups B (all MC) and D (combined 
modes but uncorrelated with item 
type) (F — 4.71, df = 1/116, P < .05). 
The latter two groups did not differ 
(Е < 1). 

The difference b 
posttest scores Was 
of the amount gained from the ex- 


perimental training. 
R test show results very 


etween pretest and 


CR posttest. Again, 
reached the . 1 
(F = 2.48, df = 3/1 
of item were not sii ificantly different 
(F «n and there w 
59, df = 3/116). 
arisons among these 
ed the same pattern 
as did those on the posttest. Group A 
i “cher than Group С 

= 1/116). Groups A 
antly higher than 

mnd DE = 6.33, df = 
< .05). The latter two groups 
differ (F < D- 

Means and standard deviations of 
the amount of time taken in training 
are presented in Table 2. There were no 


did not 
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TABLE 2 
TRAINING TIME (IN MIN.) 
Training mode M SD 
7.5 
itructed response) 39.4 
0 агане | ma | ga 
i ITI B 
5 (Combined: Cnoorrelated) 40.2 9:2 


differences among the experimental 
groups (F = 1.82, df = 3/116). 

An efficiency score was calculated 
for each subject by dividing his post- 
test score by his training time, giving a 


& Camp- 
differences 


also failed to show differences (P = 
1.60, df = 3/116). 

There was a significant improvement 
from pretest to posttest on the MC 
criterion test (p < .01 for each group), 

ut, as in a previous study (Williams, 
1965), this test was not Sensitive to 
ifferences in training mode F< 1, 


df = 3/1 16, on both posttest and gain 
Scores), 


Discussion 


The results were as expected in 
terms of the effectiveness criterion, 
that is, achievement Scores. The su. 
perior posttest Scores and gain scores 
of the CR training group (A) as com- 
pared with the MC training group (B), 
previously demonstrated by Fry ( 1960) 
and Williams (1963, 1965) on CR 
tests, also were found in the present 
study, where straight reading frames 
were incorporated into the programs, 
Since this comparison reached sig. 
nificance, the present experimental 
Situation was considered Suitable for 
assessing the relative effectiveness of 
the combined modes, 

The important comparison is that 


WILLIAMS 


between Groups C and B. As predicted, 
the program that combined CR bacs " 
ing on technical items and MC traini E 
on other items (Group C) was nid 
effective than the wholly MC prog: e 
(Group B), indicating that a combin: 2 
mode сап indeed boost qp 
Moreover, it is clear that it was "i 
correlation between CR training e 
technical terminology that — ndi 
for Group C's superiority, an hich 
the mixed format. Group D, W al 
contained the same mixed format but 
the same amount of CR training was 
Uncorrelated with item type, din 
significantly below Group C, an 
fact was no better than Group B. been 
Togrammed instruction sert bé 
criticized on the grounds that | ting 
comes monotonous and € mé 
(e.g., Fitzgerald, 1962). It is pe 
that the routine, highly limited "king 
tends to restriet independent s 4007 
(Doty & Doty, 1964), and such sug- 
cerns have led to the reasonable efer, 
gestion (Taber, Glaser, & Бона in 
1965) that there be some variatio 
the format of the program. „айй 
e present Да. suggest, how tion 
that merely introducing a put » 
of modes into a program on а таоме" 
basis does not enhance the d - 
ness of the program. Unless а арі" 
format program is designed о ction 
talize on the favorable S an 
between the Specific training mo! 
the type of item (in this wc it 
mode and technical items), t single 
might be as effective to use а ds of 
training mode, Other metho ram, 
incorporating variety into a pr OF cake 
OWever, such as interspersing forma 
type” frames that have no 4, self- 
confirmation (Hershberger’s, 196 might 
evaluational response items), 
Prove useful. svo with 
The data are not png оз. 
Tespect to the efficiency cri ere n? 
Contrary to expectation, there W 
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Significant differences among the 
A es the amount of time taken 
Gor : ough the program. The addi- 
P о the large number of reading 

mes, which all groups could go 
Me i» ts a rapid pace, probably 
M ап overall training time differ- 
did & : herefore the present conditions 
in wi ord an adequate situation 
um Би test the hypothesis that а 
к : mode where mode is cor- 
Miu rh item type would be more 
oS than an all-constructed-re- 

program. 
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LEARNING AND MEASURED ABILITIES 


JAMES P. DUNCANSON 
Princeton University 


The interrelationships among measures of learning and measures of 
abilities were investigated by administering a battery of ability tests 
and learning tasks to 102 6th-grade children. The learning tasks in- 


on 
tradiction depends first of all E 
adequate measures of ciam e o 
Second upon the Scope and n the 
the learning situations and о 
abilities measured. which 


Measures of learning ability f 
that we find statements to the effect have been used include final Terie. 
that learning ability and intelligence attainment, gain (the difference 


" Pe al levels), 80 
McGeoch & Trion, 1952. Т. a ait enl fugi 6 


J 
; Tyler, 1956 parameters of learning curves. 55 
Woodrow, 1946). In addition, learning of these is subject to more ОГ ect 
Ш one situation appeared to be un- Y: 


serious objections, including the ef- 
of different initia] levels, ce тегеп 
fects, unequal difficulty e п the 
l TY- stages of learnin ‚ the choice о. 
day Observation as Well as the con. functions to imei the Jon" eer 
Process, and the number of param 


related to learning in 
(Guilford, 1940; H 


tween intelligence and scholastic i : 

5 [ > . { 
achievement and the established use. de by the d studies 
fulness of ability testing for Selection ters ghee d ip b 


Га i have investi lationship. 4 

and prediction. Resolution of this con- Publ, mi p peaa ab t 
using parameters of fitted ho on 
1 This report is based on а dissertation CUrves. Both Stake (1961) an йе 


submitted to Princeton University in par. (1960) found in factor-analytic 8 


tial fulfillment of the Tequirements for the that par learning б 
degree of Doctor of Philosophy. The author ee Ai P 


‹ У m achieve je 
is indebted to his dissertation committee, eto ability end add 


ere 
particularly Harold Gulliksen, to Miss Ruth measures and that there w ore than 
E. Parker and other school officials, and to tional factors common to m gener 
Educational Testing Service. Financia] One learning situation, but no 


Support was received from Office of Naval factor common to all. tbe 


that , 

Н cei А , Games ( 1962) also found jat? 
Gulliksen, Principal Investigator, and Offi A 5 ired-assot 

of Naval Research Contract Nonr 2214(90), learning of serial and paired 


eas 
John W. French, Principal Investigator, verbal tasks was related to dition” 
* The author is now at Bell Telephone of Tote and span memory. pete? 
Laboratories. 


evidence of a relationship 
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1 . 
tribute] and measured abilities is con- 
ены а area of motor tasks by 
an d Fleshman cot (1954, 1955) 
15 apparent, then, that abilities 
ивы to performance in а 
Cie чеш 'The principal dif- 
Ко etween the present investiga- 
Nera previous ones is an attempt 
Pi А е more adequate measures of 
айса" performance and to system- 
ing py a small variety of learn- 
арор в basic question is the re- 
and She etween measures of ability 
небы UP of learning and between 
Shin ща in one task with 


Метнор 


Learning Tasks 


Th 
ty Bets types of learning tasks and three 
9f nine enia were combined for a total 
Ormation 5, The types of task were concept- 
ory; the ү) Paired-associates, and rote-mem- 
Merica] ypes of material were verbal, nu- 
жы апа figural. 

Adapteg ConcePt-formation tasks жеге 
task qi from the Wisconsin Card Sorting 
Series of E 1948). Each task consisted of a 
ages bei imuli printed one to a page, the 
Stimulus E bound into а booklet. Each 
In three ies four attributes which varied 
“eDt-form, eps. For example the figural con- 
three bon task stimuli had one, two, or 
th linm ers; one, two, or three figures; 
8 were either circles, squares, or 
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(left) and negative 


Fic. 1. Positive 
(right) instances of the concept "shaded" 
taken from the figural concept-formation 
task. (The correct response was printed on 
the back of each page behind the cross- 


hatched area.) 


values of one of the attributes was chosen as 
defining positive instances of the concept. 
For the example given, the concept was 
*tshadedness"'; that is, all stimuli having a 
shaded figure or figures, whether they were 
circles, squares, ог triangles and regardless 
of the number of borders, were positive in- 
stances of the concept. 

The attributes and steps of the three con- 
cept-formation tasks are presented in Table 
1. Figure 1 presents а positive and a nega- 
tive instance of the concept "shaded." 
Twenty-five of the possible 27 positive in- 
stances and 25 negative instances were 
chosen and à block of 50 pages prepared. 
The sequence was random subject to the 
following restrictions: (а) the first stimulus 
was a positive instance, 0) there were five 
positive and five negative instances 1n each 
block of 10, and (c) no more than three posi- 
tive or three negative instances, occurred 
consecutively. The block of 50 stimuli was 

he figural task and three 


trian 
Bles; А 
Shadeq sd And. the figures were either white, repeated twice for t s 
› Or black. For each task one of the times for the verbal and numerical tasks to 
TABLE 1 


ATRI 
BU 
TES AND STEPS USED IN CONSTRUCTING CONCEPT: 


„FORMATION LEARNING TASKS 


Concept forma 


Ci 


Анны 
ute. 
Nu 2 Steps Attributes 
mber Á- 
ber Кә Number of bor- 
Poor ee Type of border 
worda 
Noun, ^, adj B 
Ype of vat ive verb^, adjec- Number of num: 
АШ capitals, initial | Number of digits| 
letters capitalized, | рег number 


lower-case let-| 


tion—numerical Concept formation—figural 


m 
Steps Attributes Steps 
M ere 
Number of bor- | 1, 2, 8 
523 ders : 
Solid, sides only, Number of fig- | 1, 2, 3 
ures 
1 [^3 ge Y | туре of figure Gua i square, 
, riangle 
Brightness of fig-| White, shnded?, 
1,2,3 ure? black ^ а, 


a 
The 
parti, 
cular aspects defining positive instances of each concept. 
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James P. DUNCANSON 
TABLE 2 
Parrs Usep IN Parrep-Assocrares LEARNING TASKS 
" " 1 
Hidroist Paired associates—numerical? Paired associates— figural 
" nse 
Stimulus Response Stimulus Response Stimulus Respo 
Yum jewel POLEF 63 1 B 
TIS dinner GOKEM 95 2 n 
299 money LATUK 57 3 x 
KEM village ZUMAP 89 4 "i 
soz insect TAROP 31 5 
NOL wagon BALAP 18 6 x 
RUH office KUPOD 76 7 e 
BEK kitchen MEDON 42 8 A 
9 ii 
Example pairs 
JAX uncle NARES 71 A / 
ZAD leader CAROM 47 B A 
wig zero FIMUR 90 С 


* Pairs selected from list used by Martin and 
b logs selected from Noble (1953) ; numbe: 


Para: 


answer printed 
On the last page of the booklet he was asked 
to write the basis for his responses, to ver- 
balize the concept, if he could, | 

The instructions to the subjects stated 
that they would be guessing at first, but 
that as they went along, they should be able 
to get more and more correct, answers, and 
that they were to try to get as many correct 
as possible. A subject’s learning performance 
was scored in terms of the number of cor. 
rect responses for successive blocks of 10 
stimuli. 

Table 2 presents the 
paired-associate learning tasks. АП three 
tasks were presented in modified teaching 
machines using the anticipation method. 
Thus the subjeet turned the knob of the 
machine until the stimulus member of a 
pair appeared in the opening of the ma. 
chine. If he knew the response member of 
the pair, he wrote it on the paper, then ad- 
vanced the paper so that the stimulus and 
the subject’s response moved under a plastic 


pairs used in the 


Saltz (1963). 
rs from random number table. 
red 

window and the correct response 1- Y h 
in the opening. The experimeter 8 to the 
pace, telling the subjects when to ыд ол 
Stimulus, etc. Approximately three ibers of 
were allowed for the stimulus ipm four 
the numerical and figural tasks o pro" 
seconds for the verbal task in order t their 
vide time for the subjects to aed were 
responses. Approximately two secon d 
allowed for the response members ted 15 
pairs. Each set of pairs was presen with 
times in different, randomized orden d i 
approximately 10 seconds between f 
tions. The subject’s score was the pus 
correct anticipations on each rep 
after the first presentation. jes 

The lists Кя in the verbal and num che 
Tote-memory tasks are given in Та in the 
ists were each presented 12 ише опё 
Same order, at a rate of approxim& s, with 
word or number every two secon resent” 
Approximately 10 seconds between P ed by E 
tions. Each presentation was one? 
free-recall test. A subject’s score Пей 
number of words or numbers reca d 
rectly on each test, without regar! 

ОЁ recall, k, 16 nor. 
In the figural rote-memory tas tudy 
sense figures were presented for ihe stunt 

Seconds. Immediately following rou bi 
Was а recognition test in which 16 А „э 
five nonsense figures were presen 


ition 


cot” 
rde" 
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group с ini 
А “oe ate one of the original figures. 
rt deed were repeated 12 times 
pud. Ject’s score was ihe number of 
rrectly identified on each test. 


Ability Tests 


The abiliti 
жеге йе selected for measurement 
ment of oz on the basis of 2 priori judg- 
Suitable ( evance and the availability of 
Kit of Renee, The tests, taken from the 
y French спао Tests for Cognitive Factors 
Ma: ARS strom, and Price (1963), were: 
Mi: Ms ciative Memory (3 subtests) 
М. mory Span (2 subtests) 


DN Shell 
р. umber Facility (3 subtests) 


R: G 
ү: епега1 Reasoning (2 subtests) 


Alsoi 
z. ane 
luded as measures of ability were 


: Paragraph Meanin 
А gra а Б 
: Word Meaning 


Spelling 

SABAR. anguage 

BBa Arithmetic Reasoning 
Arithmetic Computation 


The enti 
e " 
five ntire sixth grade (135 students in 


ldren 
fom ee analyses are based upon 


Procedure 


The 1 а 

dministerad te tasks and ability tests were 
iately one} in eight sessions, each approxi- 
s the moe long. The sessions occurred 
tpcutive pee and afternoons of four con- 
bs regular che They were administered in 
t © student, assrooms by psychology gradu- 
ants withi 5. The distribution of tasks and 
tly MAE doen was determined ran- 
tasks of the e restriction that no learning 
we of the same type or material and no 
ta’ Same s same ability were included in 
9 this Sul tod There was one exception 
f e preree the two memory-span subtests 
Ve tape s den and, to avoid the use of 
tered ee on two days, were ad- 
е Session. the first and last tests of the 

Scori 
ees the learning tasks, liberal 
re used with regard to spelling. 


Minis 


n 
tanga. 
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TABLE 3 
Lists or Worps AND NUMBERS USED IN 
VERBAL AND NUMERICAL КОТЕ Memory 
LEARNING Tasks 


Rote memory—verbal* Rote memory— 
numerica. 
| 
MODERN 83 
PRETTY 60 
GOLDEN 94 
FAMOUS 97 
GENTLE 13 
BETTER 48 
COMMON 92 
SIMPLE 78 
HAPPY 56 
NARROW 52 
HEAVY 43 
LITTLE 36 


» AA, 2-syllable adjectives from Thorn- 


dike & Lorge (1944). 


b From random number table. 


in all cases where appropriate, 
were told that there was no 
penalty for guessing, that if they thought 
they knew an answer but were not sure to 


go ahead and mark it. 


In addition, 
the students 


Analysis 
Each learning task was seored in terms 
of the number of correct responses 1n suc- 
cessive blocks or trials. The Subjects X 
ask was analyzed 


Trials matrix for each ti 
separately according to à method developed 
960). The characteristic roots 


by Tucker (1 t 
and vectors of the crossproduct matrix 
formed from the raw score matrix were 
found and а decision made a5 to how many 
were significant. 


The factor scores or coefficients of the 


subjects on these unrotated factors were 
then calculated. The scores obtained in this 
way indica tribution of each com- 
ponent to ea ^s performance jn that 


particular learning In other 

words, the first step determ 

factors are involved in learning 2 | 

the second step determines the importance 

or weight of each for a given individual. 

The scores for the individual were assumed 

to describe his learning performance. 
scores on all of the learn- 


The individual's 
ing tasks were then combined with his 
achievement and ability scores, and the re- 
sulting correlation matrix was factor ana- 
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lyzed. Seven factors were extracted, and 
rotated to an equamax solution. 


RESULTS 


Learning Tasks 


igure 2 presents the learning curves 
Es iim the 102 subjects for 
each task. Learning was not as com- 
plete as desirable for the concept. 
formation tasks, and the number of 
correct verbalizations of the concepts 
was correspondingly small, being 20, 
41, and 26 for the verbal, numerical, 
and figural tasks respectively. Learn- 
ing in the other tasks appears satis- 
factory. 

An indication of the reliability of the 
factor scores on the learning tasks is 
provided by the final communality 
estimates determined by an iterative, 
refactoring procedure, esti- 


either lower 
specificity of th 


Ability Tests 


The ability tests were 
the subjects, the sixth gr 
lower limit of the range recommended 
for the tests. Corrected Split-half relj- 
abilities and final communality esti. 
mates for the ability measures are pre- 
sented in Table 5. 


difficult for 
ade being the 


Learning and Reference Factors 


The loadings of the lear 
components and the ability 
on each of the seven factors r 
an equamax solution 
Table 6. The factors 
as follows: 


ning-task 
measures 
otated to 
are presented in 
were interpreted 
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Factor T: Verbal ability. The first fao- 
tor is interpreted as a pulum d 
factor, and it is common to bot Lo 
learning tasks and the ability m 
ures. 

Р actor I1: Reasoning ability. Аш 
the reasoning tests have their poe al 
loadings on this factor, and the Pom 
paired - associates and rote - me The 
tasks have high loadings on са 
verbal and numerical tasks do no ‘ele- 

Factor ITI: Speed. The common ©, 
ment of the variables loading 0D olve 
factor seems to be that they poe 
simple operations which almost € oug 
one can do correctly, given Wo nee 
time. Accordingly, it is called d in 
factor. It is apparently not us le 
describing performance in the 
ing tasks. а js 

bees IV: Rote-memory ability T 
factor is involved in the three 1 of 
ative-memory subtests and in emory 
the paired-associate and ee mem 
learning tasks except figural ro a rote 
ory. It is interpreted to <i oth the 
memory factor common to 20° gs 5 
ability measures and the learning pis 

Factor V: Nonverbal ged rote 
factor is primarily defined up 
memory tasks involving e a 
bers or figures. It is interpre distin? 
nonverbal rote-learning factor 
from the ability measures. _ n. All of 

Factor VI: Concept for noeud op 
the concept-formation tasks and 2° 
this factor and on no other ‘ple osd; 
other variable has an apprecia efin? 
ing on it. Concept formation с the 
by the tasks used is not ^ e othe 
abilities measured or to 
learning tasks. m. 4. 

Factor VII: Verbal loarn orn 
factor is defined primarily M EU 2 
tasks involving verbal mate пй fact’ 
interpreted as a verbal iy epit. ure 
independent of the abilities 


arD- 


82% 


LN 
8 


Mean % Correct Re 


6 9 ] 


Blocks of IO responses 
Concept Formation — Verbol 


ses 
o 


~ 
Meon % Correct Respon: 


Mean % Correct Respon 
m 
o 


o 


3 6 9 [3 


Blocks of 10 responses 
Concept Formation — Numerical 


a 


2 4 6 8 10 


Blocks of IO Responses 
Concept Formation- Figural 


o 


Mean % Correct Responses 
m 


зр 


Mean * Correct Re 
S 8 a 


о, 


ses 
8 


8 o 


N 
o 


о 


Mean % Correct Неон: 
pae e 


o 


4 6 8 10 


Trials 
Paired Associates — Verbal 


4 6 8 10 
Trials 


Poired Associotes— Numericol 


2 4 6 8 10 


Trials 
Poired Associates - Figural 


12 


$ 


N 


Mean % Correct Response, 
х 8 


o 
m 


4 6 8 10 12 


Trials 
Rote Memory- Verbal 


75 


Meon % Correct Responses 
8 


25| 
о 2 4 6 8 [*] 12 
Trials 
Rote Memory —Numericol 

$ 
299 
S 
č 75 
5 
$ 50 
5 
о 
зе 25 
c 
g 
z0 2 4 6 8 lo 12 


Trials 
Rote Memory - Figural 


Fria. 2. Trial means expressed as percent of total possible correct responses for each of the nine learning tasks. 
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TABLE 4 


FINAL CowMuNALITY ESTIMATES FOR 
LEARNING VARIABLES 


Variable Commünality 
CF-V-1 34 
CF-V-2 п 
CF-V-3 19 
CF-V-4 .01 
CF-N-1 -48 
CF-N-2 -12 
CF-N-3 16 
СЕ-Е-1 43 
СЕ-Е-2 -04 
PA-V-1 .95 
PA-V-2 .22 
PA-V-3 .25 
PA-N-1 .60 
PA-N-2 .33 
PA-N-3 .07 
PA-F-1 :46 
PA-F-2 43 
PA-F-3 «17 
RM-V-1 .57 
RM-V-2 30 
RM-V-3 18 
RM-N-1 5l 
RM-N-2 198 
RM-F-1 .60 
RM-F-2 .37 
RM-F-3 .37 


. Note—The numbers following the learn- 
ing task designation j 


ndicate the Component 
curves for that task. For example, CF-\-] 
is the first component curve of the verbal 
concept-formation task 


Discussion 


In a factor-analytic Study, 


" : only 
those dimensions 


will be revealed 
along which there is variation in both 


the variables and the Subjects, The 
generality of the findings is thus re. 
stricted by the sampling of Measures 
and people. In this study the sample 
of subjects is assumed to be representa- 
tive of the population of sixth-grade 
children from urban public schools. 
The sampling of learning tasks and 
ability measures was, on the other 


hand, intentionally restricted. It is 
possible, howev: 


er, to draw some inter- 
esting conclusi 


ons regarding the rela- 
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А T 
tionships between learning and Fu 
ties and learning in different situa BE 
In the first place, learning is re ium 
to measured abilities. With the pn 
tion of the st adeps pma e E 
every learning task was relate a 
or more of the abilities qe » 
Furthermore, the abilities pipe 
general appropriate to the ai es 

classifieation of the tasks. n t 
cifically, tasks involving wor Ar 
loadings on verbal алатау E 2 dt 
volving numbers and figures x goes 
nonverbal factors; and paire ry j 
ate and rote-memory tasks ha : 
ing: memory factors. | ái 

The не that learning 8 
TABLE 5 ou 

RELIABILITIES AND COMMUNALITIES P 

REFERENCE VARIABLES 


pality 
Variable Reliability | Comm" 
UU uu e ‚70 
IQ .87^ .82 
SAB-PM 90" 8 
SAB-WM “928 ‘67 
SAB-SP 93" "5B 
SAB-LA 81° `68 
SAB-AR .89" P 
SAB-AC 87" 42 
Ма-1 .76 “46 
Ma-2 .79 ‚78 
Ma-3 .81 .31 
Ms-1 58 57 
Ms-3 .68 й 
NA 91 ‘64 
N-2 .92 13 
N-3 .90 29 
PA .81 42 
P-2 .79 49 

P-3 .88 

‚50 
RA .44 43 
R4 .44 58 
VA .62 42 

V-2 .68 cake 

кеа " go 

* Reliability coefficients Pofficient? „ед 

Tom the test manual. Co nts cor row? 
marked are split-half coefficient? |) op 


гта! 
for full length with the Spe? 
formula, 


RorarED Factor Matrix 


TABLE 6 


Variable Factor 
I п ш IV M VI VII 
IQ 
SAB- 47 52a 40 06 15 15 03 
SAB м 69^ 48 28 06 —06 08 13 
SAB.sp 77 37 17 03 —07 00 13 
SAB- 55s 24 39 25 20 E 25 
e * 07 08 10 
SAB 69 22 15 13 
“aR 42 59" 34 12 07 04 12 
S. 
ae 28 51° 40 -1 16 22 09 
M 09 10 18 54^ 22 08 17 
ж 19 0 m 04 13 
Ma. 17 04 6 
a 1^ 27 —08 44 
Ms. 33 14 27 5 
E a —12 24 18 03 
Ms.3 44 06 —03 12 18 Ё - 
63" —25 05 09 22 
N- 
N i 26 23 77° —01 1 04 _% 
22 32 06 03 
N.3 15 32 59^ 2 = 
15 06 06 
Py 19 03 80° 15 15 P. z% 
Pg 01 19 30^ 16 © 09 
P.3 10 06 56° 16 » T E- 
10 46^ 4l 20 
Ra —01 23 
R4 34 55^ 12 —05 z zn =ч 
Va 29 48^ 24 17 a 3 18 
уз 63^ 26 17 03 am 07 E 
Ср.у р 55^ 21 3 o 15 fos 14 
Cr. 13 05 = - 12 15 
TS 09 11 21° —08 07 
CR-V. Е 25 -14 
crys 17 —01 —21 16 e B Zos 
Cry 04 —02 —03 16 Е. ven ш 
СЕЛО —07 14 —04 Ж 06 —95^ —06 
СЕ. з —07 —10 —18 = E 16 19 
Cr = 08 —03 —03 07 58° 17 
—06 —23 03 06 к 
Crp. E 
PAYG E 04 188 01 Y ж 34 
PA-V. 40^ 25 22 30 —09 —93 —31^ 
PAV. -% | —02 @ | =й | =} a 18 
PANY —25 17 | -3 | -2 в a 48 
PAN —03 24 01 39 a —11 14 
Я 09 10 —04 —59^ M 
PAN —05 —25 
ag 02 
PA? 02 07 05 E A 22 17 
PAD —04 33 18 20 13 —08 
Pap —15 27 —06 51° ET —16 —21 
Ruy, 03 14 =18 | = ar 07 49^ 
RM 39 13 34 18 | $9 _07 10 
R e: 04 10 16 49 и 
—09 —86° 
ША. - —01 18 "a 03 08 
М-М 04 02 1 57 8 
RMN —02 16 21 25 42^ 03 —15 
RM.p —06 —04 08 ai 44 05 -03 
RM. S 03 64а 03 01 Sa 09 07 
DUE 00 55^ 03 15 57° 01 10 
—06 08 02 = 
N - 
a Tie; Decimals are omitted. 


highest loading for a variable. 
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of abilities are related is not 
mue ene been reached both in the 
field of motor tasks (Adams, 1957; 
Fleishman, 1960; Fleishman & Hem- 
pel, 1954, 1955) and in the field of cog- 
nitive tasks (Allison, 1960; Games, 
1962; Stake, 1961). 'The present re- 
sults are corroborative of these previ- 
ous findings and extend their generality 
to learning tasks more representative 
of those found in other psychological 
literature. 

The second conclusion is that there 
are learning factors which are inde- 
pendent of the abilities measured. 
Three such factors were found: The 
concept-formation factor and the 
verbal and nonverbal learning factors. 


Both Stake (1961) and Allison (1960) 
also found learning factors, but in 
neither case did the factors distinguish 
between verbal and nonverbal tasks. 
Games (1962) also found two or pos- 
sibly three remaining factors after 
rote- and Span-memory factors had 
been partialed o 


ut of the learning per- 
formances, but he did not interpret 
them as learning factors. 


That the distinction between verbal 
and nonverbal learning factors is 
neither obvious nor trivial is indicated 
both by its failure to appear in the 
studies cited and by the distinctions 
which were possible but which did not 
occur in the present study. For exam- 
ple, the paired-associate and the rote- 
memory tasks did not define separate 
factors; nor did the numerical and 
figural materials separate, 

It is possible that the learning fac. 
tors represent abilities which were not 
included in these measures, but be. 
cause of the regularity with which fac- 
tors additional to ability faetors seem 
to be found, it seems unlikely that 
adding new ability measures will 
eliminate them. Rather it seems that 
there are processes involved in active, 
repetitive learning situations which are 


James P. Duncanson 


independent of the abilities gn 
be measured in a static testing Si p 
tion. Interest and motivation nage d 
contributing factors. Сш. ‘eat 
Champion (1962) have oe loi 
anxiety (as measured by the ip 
Manifest Anxiety Scale) affects 
limit of learning. ND 
The third V conclusion 18 he 
learning in one situation is ws d 
learning in others. In other uer B 
while there does not appear to баг 
general learning factor, it does Н ord 
that there are fewer learning am 
than learning tasks. This ище у 
relationship exists both in pere s 
and learning factors. Each tas 5 but 
on a unique combination of d 2 
the factors are common to other t: n is 
The fourth and final iure ee 
that the concept-formation task i3 
are unrelated to the other Le E. not 
and to the abilities measured. e possi- 
clear why this should be so. On oncept 
bility is that the tasks called сойо, 
formation are rather specialize whi 
requiring fairly specific abilities 
may or may not be the same 48 or in 
required in the usual scholastl¢ % oy. 
telleetual tasks thought of ed the 
cept formation.” That is, whet in the 
lack of interrelationships hes e spe 
battery of ability tests or in г both 
cially devised learning tasks © 
Temains at present a puzzle. 
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ABILITY TO MATCH INFORMATION WITHIN AND 


AL SENSE 

ETWEEN AUDITORY AND VISU 

ш MODALITIES AND SUBSEQUENT 
READING ACHIEVEMENT" 


SIEGMAR MUEHL SHIRLEY veas dm 
e tal Health Insti 
University of Iowa poc joe 


AND 


ti bined with reading-readiness subtests only 
letter naming made a significant contribution. The relation between 
A-V matching and letter 


cussed. 


Birch and Belmont (1964) have 


portance of auditory-to- 


need to make four di 


e basic to the reading 


within a 
sense modality. Auditory discrimina- 
tions require the child to order and 
relate sound patterns organized on a 
temporal basis. Visual discriminations 
require the child to order and relate 
graphic patterns organized on a spatial 
basis. Two kinds of integrations be- 
tween the auditory and visual senses 
are required. Preparatory to reading, 
the child must relate auditory pat. 
terns in speech, which are temporally 
ordered, to the spatially ordered visual 
patterns in print. To actually read, 
he must reverse the process by re- 


* Report based on data gathered in second 


author’s PhD dissertation, University of 
Towa, 1965. 


isual pat 

sponding to the printed kc Sos 

terns with appropriate 50 | 

quences, ca a caw мав 

By the time the child is xperienc? 

introduced to reading, his = Чегар 
has equipped him with con 


on within 
skill in integrating information igual 
and between the auditory ат ch 


Senses. He can speak a language org 
assumes an ability to psit b 
integrations; he can dignum. e a 
jects, which assumes visus. ао A 
tions; and he can name obj joh 
asked or point to them if — | aud 
assume visual-to-auditory 229 | nec 
(ory to-visual integrations, " 
tively. ired sl 

Are these ехрегіепсе-асаш nands 0 
related to the more specific deeper" 
learning to read? Are they a own ^ 
ent of IQ? Research has a dito? 
Positive relation between ounds , 
discrimination ability for 802 n 
words and reading at е ie 
dependent of both IQ and we pera 
1935; Goetzinger, D ы rela 
1960; Wepman, 1960). tion 2 
between visual discrimina 
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INTEGRATION OF SENSORY INFORMATION 


an ; 
ee appears to be dependent 
зв, e stimuli used and the re- 
testing. Па in discrimination 
stantial y ates (1926) reported a sub- 
miei. Ositive relation between word 
independ, апа middle-grade reading 
associati ent of IQ but only a slight 
was test = when discrimination ability 
Goins ( eee non-verbal materials. 
showed ił 58) and Barrett (1965) 
á ard verbal and nonver- 
A Ni ioc aar tasks predicted 
S dm = reading independent of IQ. 
“Visual dide studies, the so-called 
responses * T imination" tasks involved 
as well x copying and letter naming 
ing jee Studies employ- 
verbal D AME training with non- 
graphie m oms, 1958) and verbal- 
1964 one (Muehl, 1960; King, 
eginners m pones facilitation in 
Verba] ркы аы owing training with 
seii Seem with the ability 
lors а, tomaton between the 
ow n visual senses and reading 
Positive и eo evidence of à 
| ome relationship independent of 
Used in’ дип the methods and stimuli 
1964. rome ee (Birch and Belmont, 
Walter 65; Katz and Deutsch, 1963; 
1965," ?nd Doan, 1902; Williams, 
Th 
wee of the present study 
uie e these four sensory 
Rana skills using the same sub- 
mparable sanes of tasks employing 
Ocedures Я osea materials and 
Ak wo questions were asked: 
inte is the ability of a prereader 
Within grate or match information 
visual a between the auditory and 
tion a (b) What is the rela- 
and mers d these matching skills 
Subse ris of reading readiness and 
quent reading achievement? 


to 
aud 


W, 
Mteg 
lect, 
со; 
Dr 


Subjects METHOD 


A ва 
m " 
from elk of 119 subjects (Ss) was selected 
st-grade classrooms in two ele- 
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mentary schools in Iowa City. For the most 
part Ss came from middle-class homes. The 
Ss repeating a grade or with known visual 
or auditory anomalies were not included in 
the sample. CA ranged from 6-0 to 7-0 with 
a mean CA of 6-4. Following the beginning- 
of-the-year testing, 11 Ss transferred out of 
the school system leaving а total of 108 Ss 
for analyses involving the end-of-year tests. 


Testing Materials and Procedures 

‘our serial- or sequen- 
ks were used. When а 
atehed with a visual 
as designated V-V; 
A-A; auditory with 


Matching tasks: F 
tial-type matching tas 
visual stimulus was m 
stimulus, the task w: 
auditory with auditory, 
visual, A-V; and visual with auditory, V-A. 
The stimuli consisted of patterns of dots 
and dashes, & modification of the dot pat- 
terns used by Birch and Belmont (1964). 
The visual stimuli were drawn with India 
ink on white 3 X 5 cards. The auditory 
stimuli were presented with an electric 
telegraph key which was placed behind a 
screen so that the experimenter's (Es) 
operation of the key could not be observed 
by Ss. Table 1 shows the warm-up and test 
items in the four matching tasks. Within 
each of the four tasks shown in Table 1, six 
stimuli were arrayed in three levels of diffi- 
culty as determined by the length of the 
stimulus patterns. Items 1 and 2 are two- 
component stimulus pairs; items 3 and 4, 
three-component; and items 5 and 6, four- 
component pairs. The warm-up tasks, 
consisting of stimulus pairs of one and two 
components, preceded each of the four 
tasks. А 

The four matching tasks were arranged in 
in Order I received v-V, V-A, 
A-V and А-А; Ss in Order II receive У-У, 
A-V, V-A, and A-A. This attempt at partial 
counterbalancing was indicated by а pilot 
study. The pilot data showed that per- 
formance on task pairs containing an initial 
visual stimulus was about the same whether 

resented early or late in the 

sequence. performance on task 
pairs with an initial auditory stimulus, 
task, tended to improve 

he sequence. Since 
desired, the A-A 
a showed most 


of position, was placed 


later in t 


investigation. 
order to the two groups. . 
were colored stickers and 


Incentives ‹ 
name cards. The S picked а colored card 
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SrgGMAR MUEHL AND SHIRLEY KREMANEK 


TABLE 1 
Warm-Up AND Test ITEMS ror Four MATCHING Tasks 
Response® Response 
Visual-Visual Auditory-Visual 
Warm-Up Warm-Up 
1 = d li s 8 
2 + s 2. - — d 
b! pem d 3. & j 8 
Р є ss - s 4. - - d 
Test а i Test а 
1. s - з -- =. 
2. = = d 2. - - 8 
3. - - d 3. - == d 
4. =z A s 4. -- - q 
5. = “= 8 5. -- - == 
бз ыс, ы ете 8 6. к. = a s 
Visual-Auditory Auditory-Auditory 
Warm-Up Warm-Up 
I. à - d 1 : as d 
а E = 8 2 - - s 
. + Ы 8 3 “= А а 
4. а = d 4 -—À = 8 
Test Test 
[= and 8 1. .- - s 
2: - d XS x ub d 
B: m == 8 3. - 5 8 
ferus А d ja Agere AR š 
ES UAE: Sirt d By (ae ce pee d 
6. - = = 5 E 6. bc icai d 
ad = different; s = same. 


of stickers. 

The matching tasks were indivi 
administered by the second auth 
small room provided in each scho 
the child had been told about, the incentive 
materials, E introduced S to the Stimulus 
materials and the nature of the task. An 
introduction preceded the warm-up trials 
for each of the tasks. During the introduc- 
tory period, S was given the label of “dot” 
for (.) and “dash” or “line” for (-). He was 
taught to order the stimuli from left to 
right by being shown that each pattern 
must begin and end the same way in order 
to be considered the “same.” 

The function of each set of warm-up 
trials was to give S practice in making same- 
different judgments with similar stimuli and 
procedures used in the following matching 
task. Thus, in the A-V task, S would first 


idually 
or in a 
ol. After 


hear а pattern of dots and dashes, then see 
оп а card a pattern of dots and dashes. 
He would then be required to make a same- 
different comparison between the patterns. 
When the first pattern was visual, the card 
was always removed before the second pat- 
tern in the pair was presented. Thus the 
matching procedure was sequential in all 
tasks. 

„The E took S through the four warm-up 
trials for a given matching task once, show- 
ing S why he was wrong when he made an 
Incorrect judgment. Immediately following 
the warm-up trials, E gave the six test 
trials. 

At the end of each of the four tasks, S 
was told he was doing fine. The S's score 
WaS the number of correct same-different 
responses. 

Testing started September 14, 1964, be- 
fore pupils began formal Teading, and was 
completed on September 28, just as the top 
group in all classes started reading instruc- 


tion. Testing time ran about 25 minutes per 
child. 


тз 


Z 
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eene Readiness Test: In September, 
[эй кер чыз teachers administered The 
аспау, iroud Reading Readiness Profiles 
БОС. n & Stroud, 1956) as part of а 
та testing program. The test 
Makin m seven subtests: Using symbols, 
саттып bam discriminations (attention 
m UM uncontrolled), Using the 
Using > aking auditory discriminations, 
ing ntext and auditory clues, and Giv- 
ees of letters. 
nig Test: The Lorge-Thorndike 
"horn dios Test, Primary Battery (Lorge & 
арас) 1957) was administered to the 
he! ela e classes in Mareh, 1965. Four of 
беа th were given the test by the 
second LUNES two classes by the 
man Tests: In May, the classroom 
Кы administered The Metropolitan 
Tone teen Tests, Primary Battery (Du- 
testin 9) as part of the school-wide 
serva € program. The reading subtest 
iment: as the eriterion for reading achieve- 


RESULTS 


woos Tasks: The matching 
enun were presented in two partially 
ance erbalanced orders. The perform- 
task results indicated that the 
orde means were similar regardless of 
fides The same was true for the V-A 
Wes S. Therefore, the two orders 
anal collapsed for the subsequent 
Som. dus The matching data were 
Pen d by means of & four-factor 
ре Ss (Lindquist, 1956)- 
and Cei factors included two within 
facto wo between factors. The within 
and rs were the four matching tasks 
dete the three difficulty levels 25 
oe by stimulus length. The 
evels en factors consisted of (WO. age 
of a mS sex. The age levels consist 
(CA Eo (CA 6-0 to 6-5) an high 

TS to 7-0) group. 
tenis oe proportionality for the 
Bian 13 Ss, 6 males and 7 females, 
Temaj: at random, were dropped. The 
Sex-age © 106 Ss were grouped into 

analyse categories for purposes 
es: young boys = 8 


TABLE 2 


Score DISTRIBUTIONS, MEANS AND 
STANDARD DEVIATIONS FOR Four 


MarcHING TASKS 


5 11 31 
j| 2| 35 
42| 29| 2 
46 38 12 


106 


187 | 1.15 | 1.08 


Task 
Number correct Ue] e Á 
vv | vA | AV | AA 
———À 
1 0 
2 0 
3 0 
4 6 
5 23 
6 77 
Total N 106 | 106 | 106 
M 5.07 | 5.20 4.81 | 4.05 
SD .58 


Note.—Collapsed over difficult 
age, and sex. 


y level, 


W = 32); young girls 
(N = 21); and older girls (N = 21). 


the distribution of 


means, ап Ds 


ing tasks for these 


he distribution of 


he tasks were mark- 
level of significance 


level. Th ue r d 
d signi cant ейес ociai 
vede re 5, stimulus ic 


d with the matchi 
i. < 001 


) The 


h interaction p E 
leng” 8 collapsed over 


e presented in 


Е а А- 
AN е по reliable differences 


: е. 
sociated 5 6 
а Matching Task versus Reading Per- 
formance o maximize differences 
he reading Scores 


"ending ability t 
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TABLE 3 
INTERCORRELATIONS AMONG Foun 
MarcnurNG Tasks AND Шон 
AND Low READING ACHIEVE- 
MENT (N — 49) 


Intercorrelations 


Variables pM EU | 
2 3 4 5^ 

V-V task -16 | —.04 | .19 | .16 
V-A task 27 | .40* | .39* 
A-V task -25 | .52* 
A-A task .28 
Reading 

^ Тыз. 

*p < 05. 


were dichotomized into High and Low 
groups. The standard Scores on the 
Metropolitan reading test have a 
mean of 50 and SD of 10, The stand- 


€ 108 Ss available 
-year testing ranged from 30 
to 67 with a mean of 56.83. The Low 
group consisted of Ss whose standard 
Scores were 52 or lower, or the bottom 
23% of the Ss. Since a standard score 


itan norms, a 


P were actually 
above average in ability by this stand- 


top 22% of Ss. The Standard score of 
65 corresponds to the 95th percentile 
on the Metropolitan norms. Since it 
was desirable to control IQ differences 
between the High and Low groups, 
the groups were matched using the 
Lorge-Thorndike scores? In the proc- 
ess, four Low IQ Ss were dropped. 


*The Lorge-Thorndike score was not 
included as one of the predictor variables 
in the subsequent analyses since the testing 
date was in March as compared to Septem- 
ber for the matching and reading readiness 
tasks. Results including the IQ as a pre- 


dictor variable included in subsequent 
footnotes. 


The final S totals for the High and 
Low groups were 24 and 25, the mean 
IQs, 119.7 and 117.6, and the mean 
CAs, 6-4 and 6-5, respectively. The 
group differences in IQ and CA were 
not significant. The High group con- 
tained eight boys and 16 girls; the 
Low, 17 boys and eight girls. This 
distribution by sex differed reliably 
from chance (x? = 4.58, p < .05). 

To determine the relation between 
prereading matching task performance 
and extreme group reading perform- 
ance at the end of the year, a dis- 
criminant function analysis was used. 

is analysis resembles the multiple 
Tegression analysis except that it 
Permits a dichotomized dependent 
variable. The analysis program? de- 
letes those independent variables con- 
tributing least to the multiple corre- 
lation with no loss of predictive power. 
In the first analysis, the dichotomized 
reading group scores (1 for High; 0 
for Low) served as the criterion 
variable. The matching task scores 
served as predictor variables, . 

Table 3 shows the intercorrelations 
àmong the matching tasks and reading 
achievement. Table 4 shows the sum- 
mary of the discriminant function 
analysis. The four predictor variables 
yielded a final multiple correlation 
coefficient (R) of .58 with reading. 
Only A-V and V-A matching con- 
tributed significant beta weights to the 
final R.s In Table 5, the High = 
Low readers are divided into high an 
low performance on the A-V matching 
task. The table shows that only one 
of 23 High group readers had a low 

-У score, whereas over half of the 
ie 


з Program developed by Bechtoldt à 
University of Towa Computer Center for t 
BM 7044, using Fortran IV. ждө. үйө 

* Ап analysis using all 108 Ss which as 
cluded IQ and the four matching tasks at 
predictors yielded a final R of .45. Signifo 
betas were reported for V-A (.23) and 
(.34) matching. 
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TABLE 4 


Dis 
CH 
rac Toca FUNCTION ANALYSIS InvoLv- 
AND Hrg MarcniNG Tasks AS PREDICTORS 
GH AND Low READING ACHIEVE- 


MENT As CRITERION (N = 49) 
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TABLE 6 
DISCRIMINANT FUNCTION ANALYSIS INVOLV- 
ing Four Матсніхо Tasks Pius READING 
READINESS SUBTESTS AS PREDICTORS AND 
Нісн AND Low READING ACHIEVEMENT 
as Crrrertion (N = 49) 


Predic 
to; i 
г variables | Initial betas* | Final betas? 


У-Ү tash 
k : 

SEO | 
-V task = E 
A-A task a NT 

ABL 
= .60 
b 
PR = 58 
P< .05 


Low т 
AY RM were low scorers on the 
ship determine the degree of relation- 
the н qa the matching tasks and 
Whitin o CINE tests, and the 
readin ion of each to predicting 
analysis а second discriminant function 
readines was run. Scores from the seven 
ing зае subtests, plus the four match- 
le dich served as predictor variables. 
Served otomized reading scores again 
Foi the criterion variable. Table 
Analysj marizes the results of this 
sis. With all but two tests de- 


Distrig TABLE 5 
отон or Hrau ann Low READERS 
Е м A-V Marcnine Task 


Low A-V | High A-V 


scores (1-4|scores (5-6 Total 
correct) | correct 
H 
ow padera 1 23 24 
а! 
Wt] 5 p T 25 


lete, 
(Using context and Giving 


Dam, 

Of the of letters), the final R was .83. 

Sl&nifie two, only letter naming had & 

namin, ant beta.5 Since the letter- 

bution test made such a large contri- 
© predicting reading perform- 


6 
An 
° апа i i 
ludeg ?nalysis using all 108 Ss which in- 
» readiness subtests, а hie 


" m^ Initial | Final 
Predictor variables betas" | betas? 
Harrison-Stroud subtests 
Using symbols 
Making visual discriminations 
Attention span controlle 
Attention span uncontrolled 
Using the context |, : 15 
Making auditory discriminations 
Using context, & auditory clues 
Giving the names of letters .79* 
Matching tasks 
у-у task B 
V-A task —.04 
A-V task 14 
А-А task —.01 
aR = .86. 
bR = „83. 
‘p< .05. 
ance, it was of interest to examine the 
between this variable and 


correlations 


the four ma The V-A and 


tching tasks. 


TABLE 7 
TION OF HIGH AND Low READERS 


DisTRIBU' 
ом LETTER-NAMING SUBTEST 


Giving names of letters 


Lowest Highest 
50%. (score! $005 (score Total 
<11) 218 


High readers 0 5 s 
Low readers 13 ЕА 
Total 13 


ў igni tly with 
-V tasks correlated significan 
n naming iD the extreme groups, 


i ¿s as predictors yielded simple 
SUM M taake M Mading as follows: Read- 


ons wi К кушы 
sin 

(28), Vist 

d (24), Using 
Auditory discriminations 
context and auditory clues 
ming (57); Matching tasks— 
30), А-У (40), А-А (325); 
al gnificant 
for Using symbols (A7), 


(46), V- 


Letter naming 


(17). 
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.53 respectively. In Table 7, 
ie TEN and Low readers are divided 
into groups with high and low per- 
formance on the letter-naming sub- 
test. As can be seen, the distribution is 
almost identical to the one obtained 
using A-V matching-task performance 
(see Table 5). 


Discussion 


The first purpose of the present 
study was to investigate prereaders’ 
ability to match information within 
and between the auditory and visual 
sense modalities. The results revealed 
marked differences in first-graders? 
ability to make "same-different" com. 
parisons between dot-dash patterns 
presented as auditory, visual, and 
cross-modal pairs (see Table 2), 
Matching visual pairs was easy for 
most children. Out of six test items, the 
majority of children made a perfect 
Score. None got few 
correct, Matchin: i 
the most difficul 
one-third scor 
Matching visu: 
tory-to-visual 
in difficulty, 
proving the more difficult of the two. 


The second purpose of the study 
was to investigate the relationship 
between these 


four Sensory matching 
skills and reading readiness and sub- 


Sequent reading achievement, When 
extreme reading groups (based on 
end-of-year tests) were equated in 
IQ, the analysis showed that 
beginning-of-the-year ability to match 
visual-to-auditory (V-A) and auditory- 
to-visual (A-V) pairs made Significant 
contributions to predicting reading. 


8 Since the elements within an auditory 
stimulus pattern cannot be presented si- 
multaneously as can the elements within 
& visual pattern, the auditory pattern is 
probably more difficult to perceive and re- 
call, thus making comparisons involving 
these stimuli less accurate. 
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Table 5 shows that only one of 14 Ss 
who had a low A-V score at the be- 
ginning of the year appeared in the 
high reading group at the end of the 
year. Of those Ss with high A-V 
Scores, twice as many ended up in the 
high reading group. Thus, for children 
used in this study, the general ability 
to relate information from the audi- 
tory to the visual sense, with IQ «d 
trolled, was markedly associated wit 
later reading achievement. However, 
Since some of the better A-V perform- 
ers fell into the low reading Sour Ч 
the skill is clearly not the only yan 
involved. The relationship of t = 
ability to relate information betwee 
the auditory and visual sense nt 
reading found in this study is ue 
sistent with the results of previously 
cited research. s did 

Ability to match visual pairs € " 
not contribute to predicting roe 
achievement. This result was to y 
expected since the task was 80 "v 
as to fail to yield a ae 
performance differences. Audi dis- 
matching ability yielded the best 
tribution of performance scores 2). 
four matching tasks (see Table kill 
Yet individual differences in this S7 
made no independent contribution. * 
predicting reading achievement. re- 
finding is inconsistent with аа 
Search evidence and at odds "ке 
Pedagogical opinions pegar 
importance of this skill in read! 
2nd early reading instruction. 
failure to find a relationship in use 
Present study may be due to ear 
of non-verbal materials. P Baer, 
studies (Goetzinger, Dirks and ken 
1960; Wepman, 1960) used Sp ing 
Words as auditory stimuli. аа 
тау well require a more specia КИ 
type of auditory discrimination em 
developed only with verbal wegen 5 
The lack of relationship coul high 
be attributed to the relativel aren 
level of language ability of the c 


the 
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Бе the study (as testified to 
may b IQ and reading scores). It 
ЖШ that only a given level of 
ен x4 discrimination skill is pre- 
that : ws for learning to read; and 
in most s level was already present 
dide of these children as an aspect 

If oe language skill. 
dotted s second line of reasoning is 
Фазе the results indicate that 
groun T be a good language back- 
Structio о not „need extensive in- 
Кли» nal exercises to make them 
patter us of differences between sound 
: m Rather, the emphasis should 
wee toward making them 
relation of auditory patterns in 
pai to visual-graphic patterns. 
that Ге this view would also suggest 
marked children come to school with 
guage E cepe ee in general lan- 
ihe ility, then training in auditory 
enne on would be important 
(1953) * 1964). Durrell and Murphy's 
Was |] findings that auditory trainine 
a Ness in improving 1680108 
in thi ement only for children poorest 
5 skill lends support to this view. 
tasks d the four sensory matching 
ге, а. combined with the reading 
Year y у to predict end-of- 
extreme ing achievement for the 
name ESA groups, the ability to 
tributio ers made the greatest con- 
ent with by far. This result is consist 
(1958) findings reported by Durrell 
iy E amet (1965). What is 
ras in the ability to name letters 
Teadin counts for the relation with 
tha, 8? The present findings indicated 
» in part, letter-naming ability 


refle, 

"elato з a more general ability to 

Ory nformation between the audi- 
odalities. 


Or 

This agi visual sense m 
ioci borne out by the significant 
ions between the A-V and V-A 
owever nd the letter-naming test. 
Mark T, at the same time, there were 
tions of differences among the correla- 
these three tasks with reading 


Corr, 
ота 
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achievement. Letter naming correlated 
.82 with extreme group reading 
achievement with IQ controlled; the 
A-V and V-A tasks correlated .52 and 
.39, respectively. Although test relia- 
bility attenuated to some degree the 
latter coefficients, it is likely that 
other factors contributed to the addi- 
tional predictive power of the letter- 
naming task. For example, the ability 
to name letters may aid in labeling, 
which would be assumed to facilitate 
discrimination and recall of printed 
words. The sound similarity between 
letter names and the phonemes these 
letters represent in words may help in 
word analysis (Muehl, 1962). 

The findings of the present study and 
similar research appear to have prac- 
tical implications for the classroom. 
The evidence points to the need for 
early identification of children deficient 
in the ability to integrate information 
conveyed through the auditory and 
visual senses. Naming the letters of 
the alphabet provides a means for 
screening these children. Although it 

arily follow that train- 

ing i is the best way of 
giving children pene x pee Ти 
"torv-to-visual integrating 8 il- 
{шешп idence that such 


me evi 
be helpful. Durrell 


d that systematic instruc- 
d sounds in first 


d better reading achieve- 
ed incidental 


when compar to inci 
instruction in this skill. — , 
P owever e children 1n the pres- 
ent study. ‘ved systematic 
instruction in es and e 
during the latter half of the kinder- 
ear. Thus, deficiencies 1n 
letter-name knowledge existed at first- 
ade level despite this instruction. 
Perhaps the ability to relate abstract 
from the auditory to the visual 
odality (€ dot-dash pat- 
rs and their names) is in 
maturational function. Birch 
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fford's (1963) findings relating 
wv epe. of intermodal equiv- 
alence for the visual, kinesthetic, and 
haptic senses support this view. They 
found growth curves similar to those 
reflecting anatomic development as in 
height and weight. 

This reasoning would suggest that 
children who are unable to learn the 
names of the letters probably will not 
be able to learn the more complex 
visual-auditory associations involving 
whole words. Thus a continuing readi- 
ness program should be provided these 
children until they indicate sufficient 


maturity to master the 


letter-naming 
task. 
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VALIDITY JUDGMENTS OF SY 
TEN YLLOGISMS IN REL: 
TO TWO SETS OF TERMS: B 
LAWRENCE T. FRASE 
University of Illinois? 


The separat 
tic reasoning were studied 
fiers rated by 64 8s on evalu 
of incongruous conclusions; 2 


e effects of affective and quantifying words upon syllogis- 
ina2 X 3 X 2 X 2 factorial design. Quali- 
intive scales were combined to yield 3 levels 
levels of quantifiers were used (univer- 
uate Ss received relevant syllogistic 


sal/particular). 32 undergrad 
t training. Results reflected the opera- 


training; 32 received irrelevan 

tion of 2 independent verbal res 
tive and quantifying terms 
the quantifier “some” pro 


produced significant 
duced gross effects and was sensitive to 
affective terms were relatively stable. 


ponse systems. Although both affec- 


iime and error effects, 


training while responses to 
tibility produced most errors. Initially, 


Moderate levels of incomp& 
Ss judged the validity of high 
after training they spen 


Consider the following syllogism: 
l good things are fine. 
h beautiful things are fine. 
erefore, all beautiful things are 


s idgments? 
clusion. ose that we change the com 
“g All beautiful things are evil” 
Does 1 me beautiful things are evil.” 
Oe c. conclusion now seem to follow 
Mises? Whos logically from the pre- 
Upon at is the effect of training 
error and time responses to the 


1 TI А 
Octoral Paper is based on the author's 
Oniversit issertation in education at the 
A fice of [4 of Illinois and was supported by 
TH funds aval Research Contract 3985 (04) 
пој ж Branted from the University of 
"gram, Yehology of Classroom Learning 


ow M 
at the University of Massachusetts- 


239 


t most time on t 
pe 


ly incompatible conclusions rapidly; 


hose conclusions. 


formal logical terms “МЇ? and 


“Some”? 

The formal syllogism is a verbal 
em-solving task in which two 
e classes of terms can be dis- 
tinguished. The first class of terms 
(all, some, no, not) determine the 
structural characteristics of the syllo- 
gism which are the sufficient conditions 
for judging the validity of the con- 
clusion. The second class of terms 
(syllogistic variables), whether X's, 
Y's, Or meaningful words, have no 
bearing on the validity of the syllo- 
gism, but may influence the correct- 
ness of an individual's inference. For 
instance, the effects of a neutral sen- 
tence “All X are Y” can be compared 
with the effects of emotional ia 

is such as “АП communists are 
e (Lefford, 1946), or state- 
ments which produce low or high be- 
Jief, such as, “АП men are women as 
opposed to “All men are males 
(Janis & Frick, 1943). | 

Mowrer (1954, 1960) has dealt with 
the relationship between verbal signs 
in a sentence as à means for producing 

nse, and Staats (1959) 


transfer of respon : 
has presented evidence which demon- 


strates that nonsense syllables acquire 


probl 
separat 
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negative or positive evaluative tone 
when paired with appropriate evalua- 
tive response terms (predicates). It is 
anticipated that when the terms to be 
associated elicit antagonistic responses, 
interference results. If such incompati- 
ble responses occur in the conclusion 
of a syllogism, reasoning should be 
disrupted. 

In support of the foregoing analysis 
are data relating to response latency. 
Berlyne (1960) found that it took 
significantly longer for Ss to rate 
pairs of words on Semantic Differential 
(SD) scales when the pairs rated were 
far apart in semantic distance. What 
Berlyne refers to as “conceptual con- 
flict” is similar to the concept em- 
ployed in the present experiment, 
referred to as “incompatibility (I).? 
Solarz (1960) also found supportive 
data; latencies of arm movements were 
shorter in response to stimuli which 
were compatible (elicited positive 
evaluative ratings) with approach 
movements. Hence, it is expected that 
combinations of incompatible words 
will increase judgment time. 

In regard to the formal stimuli, 
data from previous studies on the syllo- 
gism (Sells, 1936; Wilkins, 1928) 
indicate that Ss make more er- 
rors on particular conclusions (some, 
some...are not) than on universal 
conclusions (all, no). Although Ss have 
been told the definition of “some” 
before an experiment (to eliminate any 
systematic bias that might be intro- 
duced), it is not clearly the case that 
they actually do apply this governing 
definition. Studies on the effect of 
training on specific verbal factors in 
syllogistic reasoning have been absent 
from the literature. The assumption 
that a verbal introduction to logical 
terms overcomes the effects of those 
terms should be substantiated. 

The major interest of this paper is 
the relative and absolute effect of 

formal and affective terms upon reason- 
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ing both before and after training. 
In the present study, as opposed to 
earlier studies, predictions are made 
on the basis of an associative analysis 
of combinations of individual words 
rather than on the basis of entire 
sentences. 


METHOD 


Stimulus Words? 


A total of 44 qualifiers, such as “‘sinful,”’ 
"fearful," "successful," which were rela- 
tively “active” and "potent," and either 
"positive" or “negative” in evaluative 
polarization were selected from norms of 
English-speaking Ss (Osgood, Archer & 
Miron, 1962). These words were to be used 
as subject terms in the syllogisms. Predicate 
and middle terms were selected from other 
SD scales. The aim was to get a large variety 
of words so that many levels of incongruous 
word combinations could be formed. 


Rating Scales 


Four 7-point scales were used to evaluate 
the stimulus words selected as possible ea 
for the syllogisms. The scales, good- i 
dirty-clean, nice-awful, and heavenly 
hellish, were selected on the basis of relia- 
bility from the data of Osgood, Archer, an 
Miron (1962). Four sets of nonevaluative 
scales were used as filler items. Four sets 0 
words and scales were presented per page 


with scale directions alternated to eliminate 
bias. 


Response Incompatibility (Г) 


The range of possible scores for each уб 
was from 4 to 28. The formula for Z with ™ 
firmative conclusions was the absolute vins 
of the difference between the mean rating 


of the words (W) comprising the conclusion: 


“a ji 

I, = /Wi — W2/ Ш 
For instance, if “mother” were rated 28 = 
“evil” rated 4, then the universal affirma 
proposition “All mothers are evil’? wou 
receive a rating of 24, that is, 


I, = /28 — 4/ = 24. 


3 The complete report of these words and 
syllogisms used is deposited with Univers! 7 
Microfilms, Ine., 313 North First Street, 
Ann Arbor, Michigan. 
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In th 
were to ^ toed where word combinations 
formula was in negative propositions the 


Ty = (Г — Wal — 24/. [2] 


Th 

e "т 

Nous PAR *No mothers are evil" 
us receive an 7 value of zero. 


For 
Sure M 1 and 2 are analogous to “ргез- 
ie MAE used by Osgood and 
Need ed t except that I is ex- 
this beca units more convenient for 


Test Syllogisms 


Т} 
levels of I. E syllogisms were used at three 
ina и pe of 8 syllogisms terminating 
erminatine i conclusion had its counterpart 
Universal Bana particular conclusion. These 
matched nena particular syllogisms were 
rules they terms of validity and the logical 
test sila violated. There was 2 total of 48 
mean 7 gisms, 12 of which were valid. The 
three leye pines for conclusions used at the 
ee a and 20.6. There 
REKE erlap between categories. 
have an imples, the following two syllogisms 
in terms value of .71 (low) and differ only 
usion of the quantifier used in the con- 
All si 
al things are cruel 
haven disgusting things & 
ШАГ all disgusting thin 
and ^ things are cruel 
ever disgusting things ar 
e Ore, some disgusting 
Two ңе, 
Riven p ан I syllogisms (7 
aig OM illustrating a nega 


re sinful. 
gs are cruel. 


e sinful. 
things are 


= 13.9) are 
tive conclu- 


Some ; 
esi pL things are foolish 
Therefore., foolish things are fearful. 
в ing e, no fearful things are interest- 
оте į | 
aide айар things are foolish 
These tonhisn things are fearful. 
i teresting. fearful things are not in- 
Ehly i : 
qx V ae t gs combinations 
9 takural thing the following S. 
and n al things are creative 
Therefor natural things are su 
tive” no successful things а 


Suh; 
Ubjects 
Six: 
t ty- 
y-four undergraduate students from 


. -hlverg; 
train; Е pur of Illinois (with no previous 
formal logic) from 28 academic 


of words 
yllogism. 


ccessful. 
re crea- 


241 


areas were paid $1.00 per hour for ici 
3 s 1! participat- 
ing in the experiment. Grade levels ed 


from freshman to senior. 


Apparatus 

The computer-based teaching machi 
system SOCRATES (Training mh 
Laboratory, University of Illinois) was 
used to present the test syllogisms. See 
Merrill (1965) for references on this system. 
The Master I/O (input-output) stations 
associated with this system provide random 
access to any one of 1500 35-mm. filmed 
images. The filmed image is projected on a 
high-contrast rear-projection screen. A red 
response button was located on the desk top 
next to the I/O station which was located 
in a DuKane language laboratory booth. 

Two Master I/O stations were employed 
When 8 reported for а test 
to a machine and the logical 
«АП? and “some” was explained 
verbally according to the procedure reported 
by Sells (1936). The Ss were given à response 
booklet containing instructions on how to 


indicate the validity of each syllogism. The 
also repeated instruc- 


instruction booklet 

tions on the logical meaning of “some.” 

When S was ready to begin, or to proceed 
the red re- 


to a new tes 
sponse button. OCRATES 


randomly selected a test item for S, 
(on JBM cards) from the il- 


ing the time 
Jumination of the test frame 
sponse button was released. 


rds on SD evalua- 
valuative ratings of 
ds was used to select word com- 
t three levels of I. The resulting 


filmed. A 
] rating session 


after the initia! 
istic test over the 


time 


these wor 
binations а! 


6) 
ir judgments. 
me Ss were randomly 


ek later the sa 

oups- One group under- 
programmed instet 
:» syllogisti easonini in whic it was ex- 
sj syllogist Pt syllogistic variables (af- 
s) have no bearing upon the 
"v of a syllogism. The distinc- 
between formal validity and 
a conclusion. The other group 
irrelevant task (concerned with 
m for symbolizing proposi- 
of time. For this 


period 
ade to syllogistic 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF TIME AND ERROR SCORES 
FOR MAIN INDEPENDENT VARIABLES 


Time in seconds® Errors 

Variable Pretest Posttest Pretest Posttest 

M SD M SD M SD M SD 
ining> 

kc m 344.65 | 138.17 | 235.43 | 88.83 | 2.35 | 1.05 | 2.27 1.80 

Relevant 354.72 | 121.33 | 260.41 121.00 | 2.22 1.19 1.86 é 
и Low 356.51 | 128.03 | 253.30 108.79 | 2.06 1.25 2.02 | 0 
Moderate 355.89 | 145.50 | 258.17 | 111.50 2.47 1.33 2.10 En 

High 336.65 | 135.42 | 262.29 115.50 | 2.33 1.26 2.08 1. 
Conclusions* v 
Universal 323.32 | 118.00 | 243.86 | 99.32 1.52 1.09 1.54 d 
Partieular 376.05 | 149.93 | 271.98 120.00 | 3.06 1.44 2.59 1.5 


Note.—Time and error Scores re 


* Due to mechanical failure, 
available. 


^ Time, N = 28; Errors, N = 32. 
° Time, № = 56; Errors, N = 64. 


variables or affective terms. Both pro- 
grammed texts were developed by the same 
authors (Kearns, Odell, & Zartman, 1964). 


The Ss were not informed of the purpose of 
the present study. 


One week later all Ss retook the syllogistic 
test of 48 items. 
Design 


A2X3X2x2 factorial design (fixed 
factors) with repeated measures on the last 
three factors was used; Factor 1: relevant or 
irrelevant training; Factor 2: low, moderate, 
or high 7 conclusions; Factor 3: universal or 
particular conclusions; Factor 4: pretest 
versus posttest scores. 


RESULTS 


Time 


There was no significant main effect 
of I on time. There was a significant 
interaction between pre-posttest and 
I (F = 3.15, df = 2/108, p < .05), 
indicating that Ss initially made snap 
judgments on incompatible conclusions, 
but on the posttest they spent most 
time on those conclusions. The mean 
judgment times are reported in Table 
1, 


present the average for 8 Syllogisms. m 
time records for 4 Ss from each iraining group were 


Total time was reduced on НЕ 
posttest (Р = 49.1, df = 1/54, p d 
001). The means for particular an 
universal quantifiers were significantly 
different (F = 38.02, df = 1/54, p < 
001). The formal terms accounte 
for 27 % of the within-subject variance 
in time scores, while the affective 
terms accounted for 1% of the vari- 
ance.* 

Cumulative plots of time scores 
indicated no warm-up effects. 


Errors 


There were significant main effects 
of both affective and quantify m 
terms on errors. For Z the F was 3: 
(df = 2/124, p < .05). The F for 
quantifiers was 141.88 (df = 1/62, p P 
001). Multiple comparisons (N эл 
man-Keuls method) indicated | tha 
the moderate level of 7 differed signif З 
cantly (р < .05) from low J, but the 


* See Hays (1963) for the estimate af oe 
Proportional reduction in variance 0 
given X—est. w?. 


N- 


to 
= 
©з 
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aie at high J did not differ signifi- 
ed from either low or moderate 
tender means (Figure 1). There was a 
effect of toward | reduction in the 
до чт after training, but the inter- 
wasi not „зан pre-posttest and I 
2/124). ignificant (F = 1.85, df = 
Ns 2 summarizes the effects of 
and the and particular quantifiers 
Gon avn relative influence of training 
оаа Differences between group 
cant сна on quantifiers were signifi- 
on is for particular conclusions 
ps posttest, (p < .01). The train- 
action оцр-дпаћйег-рге/ posttest inter- 
.05) F was 4.33 (df = 1/02, p < 
ing › and the interaction F for tram- PRETEST 
1 Jog oup-qualifier was 8.90 (df = Fro. 2. Effect 
effects” > ok Although the specific made on universal and particular quantifiers 
tom mi of instruction are apparent for the group trained and the group not 
trained eros 2, over the total test E trained in syllogistic reasoning: 
тоир ws ior to the 
p was not superior — T 89 


Untra; 
rained group. On a later test (not а Formal terms а t ) 
-subject variance 1n error 


u 
o 


и 
PARTICULAR (UNTRAINED) 
UNIVERSAL (TRAINED) O77 -o 


40 405A, PARTICULAR (TRAINED) = Ac - --A 
` 


> 
о 


35 


3.0] 2.99 


POSTTEST 


ME 
AN NUMBER OF ERRORS ON SIXTEEN SYLLOGISMS 
7 


№ 
w 


of training upon errors 


Dar r t 
ins of the present analysis) the trained of the within 1 
P аашаа, scores (est. w). Affective terms ac- 

or 1% of the within-subject 


counted f 


vV 
2 
D 34 A 
с: А variance. " а 
3 32] As in the case of time ноз бше 
28 urin 2 
MET were no warm-up effects 5 
E : 
6 28 ing. 
ü 2.69 
= 38 Discussion 
2.4. men 
P tudy indi- 
g f the present stua, 
ee ep resenting instructions оп 
a 20 nad oe i Taning of “some” verbally 
ч. ical mi ajari c 
© ig P aom test instruction booklet 
È ig and ш sufficient condition for 
a ras not & > 
P alone Wa effectively control be- 
3 14 139 that E Ө айр which underwent 
= 12 havior. a d signifi- 
М] x showed signi 
(А struction 
S 10. relevant E - improvement on syllo- 
HN P 0 pU cantly grea ticular conclusions. The 
Low MODERATE HIGH : with partt i 
MODER gisms e hypothesis," which was 
" LEVEL OF I vatmosphery ais of Sells’ (1936) early 
G + . 
Vels Ti Mean number of errors at qus the persisted of 16 predictions (of 
а or syllogisms with universa. work, É M be сї 
А s would be chosen as 
hat conclusion 


л 
. Parti А 
Фер cular (P i he bar in- 

‚ Cate; ) conclusions. (The : й 
Чеши; the average of the universal and P^" valid) based on the quality and quan 


co " É 
nelusions combined.) 
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i remises. Of the 16 predic- 
athe Ade the basis of the atmos- 
phere hypothesis, 80% were particular 
conclusions. The results of the present 
study indicate that atmosphere predic- 
tions are, to some undetermined extent, 
confounded with E. effect of the 

ica finition of "some. 

II to note that there 
was no interaction between formal and 
affective terms on either time or errors. 
In terms of Osgood's (1963) model of 

the language process, affective re- 

sponses are associated with relatively 
stable interactions based upon past 
associations (of the stimuli involved) 
with similar or dissimilar objects or 
events. The results of the present 
study attest to the relative stability 
of affective interactions. The relatively 
large amount of variance accounted for 
by formal terms is also consistent with 
Osgood's statement that the effects of 
quantifiers should follow a multiplica- 
tive function. 
More time was spent on incompati- 
ble stimulus combinations on the 
posttest—a reversal of pretest results. 

If conceptual conflict, as outlined by 
Berlyne (1960), consistently functioned 
to increase decision time, one would 
not expect such a pre-posttest inter- 
action. When Ss have become sensi- 
tized to verbal stimuli, which pre- 
sumably occurred to some extent on 
the pretest, time scores tend to reflect 
efforts to overcome negative transfer 
effects. A theory of conceptual conflict 
might usefully take into account such 
interactions between stimulus famili- 
arity and response bias when complex 
relations are involved. . 

The present experiment provides for 
a comparison of two hypotheses: (a) 
that stimulus compatibilities increase 
acceptance of conclusions, and (b) 
that stimulus compatibilities facilitate 
reasoning. Because of the large number 
of invalid conclusions, an Increase in 
the acceptance of conclusions would 
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mean an increase in the number of 
errors, but fewest errors were made on 
conclusions with highly compatible 
terms. 

It was noted that there was no 
significant difference in overall errors 
made on the posttest between the 
group which received relevant train- 
ing and the group which did not. The 
trained group did improve significantly 
in dealing with particular propositions, 
and the results of their training showed 
up in a later test. It is apparent toai 
overall test results may not reflec 
improvement in reasoning which = 
precise instrument can reveal. * 
syllogistic reasoning, improvement p 
understanding the logical meaning © 
"some" foreshadows enhanced genera 

erformance. . 
i Results clearly revealed the opera 
tion of two response systems “a 
reasoning; responses controlled ed 
formal structural terms and n 
controlled by affective terms. ^w 
former were relatively sensitive S 
training and exhibited gross вза 
The latter were less influence à 
training and exhibited moderate "o 
Sponse characteristies. The Us T 
syllogism, which can easily be exte J in 
from the standard form employe e 
this study, might prove a useful e 
digm for studying the effects of pos 
variables upon the two response 
tems in verbal problem-solving. 
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DELAY OF GRATIFICATION AS A FUNCTION OF 
CHARACTERISTICS OF SOCIAL AGENTS: 


MARCIA J. LAWTON 
University of Nebraska College of Medicine 


Following an irrelevant task 171 1st and 2nd graders were individually 


offered, by a male or female stran, 
choice between an immediate, les 
preferred one. Data on such varia| 


time, self-control, and attitudes 


offering the choice 
aminer, and the for 
portant finding was th; 
10) fewer delayed re 
a finding that was ex; 


By the time a child is of school age, 
he is ready to be taught many new 
skills by significant adults other than 


his parents; that is, his teachers. Erik- 
son (1959) says 


good parents feel a need to 
dren trust their teachers, 
have teachers who can be 
leading to the development and the main- 
tenance in children of a sense of industry 
and of a Positive identification with these 


make their chil- 
and therefore to 
trusted . . . [thus] 


The question arises as to what com- 
ponents are involved in a teacher 
pil trusting relationship. 

The basie tenet of Mischel’ 
of-gratification approach, wh 
sists of offering children 
between an immediate, less 
reward and a delayed, preferr 
that the choice of a reward i 
tion of the expectancy of its occur- 
rence and the value of the reinforce- 
ment (Rotter, 1954). Initially, studies 
focused on the hypothesized 


-pu- 


s delay- 
ich con- 
a choice 
preferred 
ed one, is 
8 a func- 


decline in 


'Based on a doctoral dissertation sub- 
mitted to Northwestern University, June 
1963, and written under the helpful direc- 
tion of Lee B. Sechrest. The material on 
which this article is based was obtained 
from children in District 65 of the Evanston 
School System, which was extremely co. 
operative in assisting the author. 


Eer or & female student teacher, a 
s preferred reward and a delayed, 
bles as age, IQ, ability to estimate 


toward promise-keeping were in- 
vestigated in relationship to such characteristics 


of the social agent 


as sex of the examiner, experience with the ex- 
m in which the promise was given. The most im- 
at unfamiliar women elicited significantly (p = 
sponses than unfamiliar men or familiar women, 
plained in terms of trust, 
— 


the value of a delayed reward and the 
length of time interval, but in 1956 
Mahrer and in 1958 Mischel shifted 
the emphasis to an analysis of trust, 
operationally defined as the expec- 
tancy for a reinforcement to — 
à function of making a particu T 
choice with a particular social REE 
in a given situation, Empirically, p^ 
search in this area has attempted z 
clarify variables relevant to prefer 
ences for immediate reward or delaye 
reward. Mostly using children from 
other cultures as subjects (Ss), it dos 
been demonstrated that choice of à ¢ to 
layed reward is related positively 1 
intelligence (Melikian, 1959; Mische 
& Metzner, 1962), to the presence ln 
the father (Mischel, 1958, 19610), 
the need for achievement and soci ) 
responsibility (Mischel, 1961a, yid 
and negatively to acquiescence as 
chel, 1961a). Finally, research ч 
Shown no sex differences in pteferene 
for delayed reinforcement (Mische 
1961b), but it has shown that prefe > 
ence for delayed reinforcement a 
ently decreases as the length of ie 
inereases (Mischel & Metzner, = 
and increases with age up to a certa a 
level depending on the particular а 
ues used (Melikian, 1959). te 
Specifically, the present study s 
tempts to demonstrate the importance 
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of Ada 
soi araeteristios of the particular 
exami agent including the sex of the 
iner mau re with the exam- 
mer, oon À 

is i5 given, orm in which the prom- 


Sex 5 
of Examiner 


T Ms studies (Alden & Benton, 
ilie ut oltzman, 1952) have stressed 
ета Шїга of the sex of the exam- 
chel? fon e testing situation. Also Mis- 
nilaan] nd male children showed sig- 
with h Y greater delay preference 
x ie delay choices when 
hasmn s by a male experimenter. 
siderabl, as young children have con- 
Se Y more experience with 
up diff it is possible that they build 
ise-ke erential expectancies of prom- 
еріпр for men and women. 


Expos; 
тре | : 
Perience with Examiner 


Tes expectancies which a child has 
Bers Omise-keeping by perfect stran- 
from Fwd differ in important ways 
Shows nose he has towards people he 
Breator and with whom he has ha 

Were m contact. In this study promises 
and ц i de by known student teachers 

nknown graduate students. 


For 
™ of Promise 


T 
Promis forms, a paper note and an oral 
that it are compared. It is assume 
Tein for, а child is given a secondary 
Promiss in the form of a paper 
than €, this will be more effective 
e 8 Fein promise both because of 
decatine icitness of the “contract” ап 
ond, 8° Of the similarity to other sec- 
bridge PS rion ai which help to 

delays (Renner, 1964). 

‘umber ional data were obtained for а 
ap of characteristics thought on 
Tiori basis to be related: аре, 


TR aye 
Octobar, Michel, personal communication, 
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IQ, grade placement, attitudes toward 


promise-keeping, ability to control 
behavior, and ability to estimate time. 


METHOD 


Subjects 

A total of 171 Ss from five first grades 
and four second grades were employed in 
this study. The schools tended to be in a 
middle- to high-middle-class socioeconomic 
area and the overall IQ was 110. Of these 
children 124 came from intact homes while 
18 came from broken or adoptive homes and 
29 came from homes whose structure was 
unknown to the teachers filling out the 


forms. 


Examiners 

A total of 14 examiners were used who 
fell into three categories: four male stran- 
gers, four female strangers, and six female 
student teachers. The strangers were all 
graduate students in their early or mid- 
twenties enrolled in a clinical psychology 
training program. They were selected largely 
in terms of availability of time and willing- 
ness to participate. 


Form of Promise 
As a reward each S was offered a choice 
between two small balloons today (ImR) 
and one large balloon in a week (DelR). 
(See below under Choice for further ex- 
planation). The DelR was offered to half 
of the Ss orally and to the other half by 
note on which his name 


means of a paper nc ] 
and the date of delivery were written. 


Assignment 
The 171 Ss were randomly assigned to 


one of the following groups: 
. Male Stranger Ora 
Male Stranger Paper 


. Female Stranger ral 
. Female Stranger Paper 
Female Student Teacher Oral 
` Female Student Teacher Paper 
i ] each examiner saw about 10 
seven and 


э ә 


One ©: 


' examiner 
age of 84. 
f 72 to 102 months, but 


the 
limited to testing Ss 


ihe ov 


student teachers were 
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within their own classroom where the age 
range was somewhat narrower. 


Procedure 


S was seen individually for about 
15 EN by the examiner to which he was 
igned. Because of the distance to the 
Fue cse Jative to most of the class- 
testing room relative a 
rooms, it was necessary for the examiner to 
come to the classroom, meet the child there, 
and walk to the vesting room with him, 
5 i i neutral way. " 
шш кыным of the actual testing 
inv e parts: 
eiue 'The "stranger" examiner 
introduced himself as being someone (no 
names were used) interested in learning 
what TV programs children of this age most 
enjoy. The student teacher examiners ex- 
plained they were helping a person from the 
university interested in learning what TV 
programs children of this age most enjoy. 
2. Task Questions. The examiner then 
asked S three questions about his TV habits 
which were really irrelevant to the dimen- 
sion under investigation, delay of gratifica- 
tion, but which served as a reason for offer- 
ing 8 a choice of reward, 

3. Choice Situation. Next the examiner 

ofiered S a reward for his assistance; 
namely, either two small balloons today 
(ImR) or one large balloon in a week 
(DelR), half with and half without the 
paper note. School regulations necessitated 
changing the candy reinforcements such as 
Mischel used to balloons, which Gray* had 
found effective, A pilot study, based on in- 
dividual testing of 20 second graders, re- 
vealed that a choice of two small balloons 
immediately, versus one large balloon in a 
week, met Mischel’s criteria. That is, in a 
straight choice situation the larger balloon 
was chosen 100% of the time whereas under 
a delayed condition of 1 week half chose 
two small balloons immediately. 

In the testing situation S's choice was 
recorded (DelR or ImR). At this point 5 
was also asked to indicate the number of 
balloons which would have to be added (if 
a DelR was originally chosen) or deleted 
(if an ImR was originally chosen) for him 
to change his choice. For example, if S 
chose the large balloon, the examiner would 
add one, then two, then three, etc. small 
balloons to the two balloons already there 
until he said he would take the pile of small 
balloons instead of the large one. The num- 
ber of balloons added was recorded ( change- 


*Susan Gray, personal communication, 
November, 1962. 
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over score) and served as a measure of the 
strength of S's initial choice; that is, the 
more balloons which had to be added, the 
stronger his original desire for the big bal- 
loon was assumed to be. It was made clear 
to S that his original choice was the balloon 
or balloons that he would receive and that 


the later choices were only hypothetical 
ones. 


4. Promise Questionnaire. Fo 
making a choice, S was asked to answer 
four questions involving what it means to 
keep a promise and whether, in fact, he be- 
lieved that people tend to keep 


promises. 
Of these four, two were answered clearly 


enough that they could be used for statisti- 
cal comparisons. One plus or minus score 
was given for whether or not he believed 
others kept their promises and one for 
whether or not he mentioned some form 
of punishment coming to people who do 
not keep their promises, 

5. Time Questionnaire. This set of 10 
questions involved S's ability to estimate 
time, both short-term and long-term. For 
example: How long before you get out of 
school today and how long will it be before 
you are married? Each question was scored 
on a predetermined three-point rating scale 
(0 = substantial error, 1 = some error, 2 = 
substantial accuracy) of accuracy, On the 
basis of the overall distribution the accuracy 
Scores were then grouped into high (11 or 
above), middle (6-10), or low (0-5) ac- 
curacy scores which were used for computa- 
tional purposes, The communicability of the 
yed was checked on a 
Ss, for which а Pear- 
correlation of .99 was 


lowing his 


questionnaire cor- 
rected by the Spearman-Brown prediction 
formula was 75. 

After the testing was completed, the 
child was returned to his classroom where 
he deposited his two small balloons (if 
ImR was chosen) or paper note (if written 
DelR was chosen) in his locker outside the 
classroom. If an oral DelR was chosen, the 
child received his balloon 1 week later in а 
Eroup from his teacher. 

6. Other Measures. Each teacher was re- 
quested to furnish control ratings and back- 
ground information. Because of the paucity 


of material available on many questions it 
Was deemed advisabl 


е to use only the fol- 
lowing: 
Control ratings. The teacher's ratings on 
a fi 


Ve-point scale of S's ability to control 
pulses and delay gratification were to- 
taled across six items for each child. These 


im 
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included his ability to wait his turn, to 
share, to ask for something, etc. The possi- 
€ range of scores was from a low of six to 
a high of 30, A split-half reliability coefü- 
cient of .92, corrected by the Spearman- 
„own prediction formula, was obtained. 
16ге Were no appreciable differences be- 
tween the mean control ratings given by the 
nine teachers, as most of them hovered 
Rund àn overall mean of 20 with the range 
cmg from 17.1 to 25.6. 
Mig, occupation. The teacher indi- 
E H the father's Occupation in almost all 
ав, In three cases it was necessary to 
m dis the teacher's gross estimate of the 
7 economic class. Otherwise the Coleman 
appli Socioeconomic class rating scale жаз 
ere to the occupational information; 
n Was interscorer reliability on a random 
EIN of 40 Ss of .90. 
IQ e many of the classes quantitative 
tion or oe based on previous administra- 
able, H ormal intelligence tests were avail- 
available eb for 68 Ss there was no score 
furnish, T in 45 of these cases the teachers 
age hi, estimates (low, low-average, aver- 
child's Wb average, and superior) of the 
done Кашу, Analyses involving IQs were 
Ported Wice, once on the basis of the re- 
оп the Quantitative scores alone and once 
ibli asis of all the estimates combined. 
Supplied i. On the basis of the information 
Of fve oo the teacher, S was placed in one 
Siblings categories indicating the number of 
three 5 36 had. There were none, one, two, 
> and four or more, 


RESULTS 


e di Squares Were computed compar- 
layeq | number of immediate and de- 
Various а choices made under the 
examin conditions. The results for 
Poole ers in a particular group were 
On this nd analyses were computed 
nary te pooled data because prelimi- 
tween 9718 Involving comparisons be- 
ers wit hi Various individual exami- 
nis n each of the groups were 
Berg a 80ificant (four male stran- 
fema = 4.50, df = 3, p < .25; ten 
D < gcXaminers, y? = 8.97, df = 9, 
, dr | SX known examiners, X? = 
хаті 5, р < 15; eight unknown 
В а ddl. = 8.64, df = 7,p < .30). 

ed on itional analysis was per- 
changeover scores for vari- 
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TABLE 1 
Comparative NUMBER oF Сно1сЕз 


ACCORDING TO SEX AND EXPERIENCE 
WITH EXAMINER 


Examiner Immediate} Delay 
Male stranger 17 44 
Female stranger 26 30 
Female student teacher 15 39 


ous experimental groups in cases of 
children choosing DelR. The analysis 
of variance used on this measure in- 
volved a range of changeover scores 
from one to five. Specific results fol- 
low. 


Variable I (Sex of the Examiner) 


In analyzing the role played by the 
sex of the examiner, it was found that 
the chi-square (1.56) based on the 
number of immediate versus delayed 
choices (see Table 1) was not signifi- 
cant for male versus pooled female 
examiner groups (stranger and 
known). However, when the results 
were based on comparisons between 
male strangers and female strangers, 
the chi-square obtained (4.30) was 
significant (p < .05). The male stran- 
gers elicited significantly more delayed 
choices than did the female strangers. 
Further analysis of the data indicates 


TABLE 2 


CHANGEOVER Scores FROM DELAYED TO 
IMMEDIATE CHOICE 


Type of promise 
————— 
Ss Oral Paper 
N| X | SD|N! X | sp 
By sex 
Male 19 | 4.10 | 1.62 | 25 | 3.64 | 1.60 
Female 38 | 3.34 | 1.79 | 31 | 4.84 58 
By experience with 
examiner 
Known 20 | 4.30 | 1.15 | 19 | 4.95 23 
Unknown 37 | 3.21 | 1.86 | 37 | 3.97 47 


250 


that there was no interaction effect 
between the sex of the examiner and 
the sex of the subject G? = 2.87). 

The top portion of Table 2 contains 
changeover scores for the two sexes of 
examiners. An analysis of variance 
yielded nonsignificant F values (F = 
20, df = 1). 


Variable II (Experience with the Ex- 
aminer) 


Both choice scores and changeover 
scores for female strangers versus fe- 
male student teachers produced sig- 
nificant differences. Examiners who 
were known to the children tended to 
elicit significantly more delayed 
choices than did examiners with 
whom the children had had no experi- 
ence, as shown in Table 1 (y? = 4.04, 
df = 1, p < .25). Furthermore, once 
the child decided to wait, more in- 
centive (small balloons) was required 
with known examiners than with un- 
known examiners, as shown in Table 
2 (F = 4.76, df = 1, p < .05). 


Variables I and II Combined 


The combined efforts of these two 
variables, sex of the examiner and 
experience with the examiner, were 
analyzed with respect to number of 
choices and changeover scores, A chi- 
square of 4.76 was obtained between 
the three groups (male strangers, fe- 
male strangers, and female student 
teachers) which was just significant 


TABLE 3 
CHANGEOVER Scores FROM DELAYED TO 
IuwEDIATE CHOICE ÁCCORDING TO SEX 
or SUBJECT 


Male S 


Female S 
Group 
NIX|SDIN|X|sp 
Male stranger 22 | 3.82 | 1.53 | 22 | 3.82 | 1.61 
Female stranger 15 | 3.40 | 1.91 | 15 | 3.07 | .60 
Female student 
teacher 18 | 4.01 | .99 | 21 | 4.62 87 
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at the .10 level of confidence (sce 
Table 1). In essence, children chose 
2% times as many delayed rewards 
as immediate ones when they were 
offered by either the male strangers 
(44 to 17) or the student teachers 
(39 to 15). However, when the offer 
came from the female strangers, there 
was a greater reluctance to choose de- 
layed rewards, the split being about 
as many immediate rewards as de- 
layed ones (26 immediate, 30 de- 
layed). Thus, it was the female 
strangers who formed the deviant 
group. 

When an analysis of variance was 
performed on the changeover scores 
obtained by the three groups a signifi- 
cant F of 6.86 (beyond .01 level of 
confidence) was derived (see Table 
3). A Duncan multiple-range test was 
applied to the data and it revealed 
that each of the three groups differed 
significantly from the others. Thus, 
female student teachers obtained sig- 
nificantly higher changeover scores 
(M = 9.23) than did male strangers 
(M = 7.64) who, in turn, obtained 
significantly higher ones than did fe- 
male strangers (6.47). Therefore, it 
appears that Ss were most willing to 
wait with student teachers, less willing 
to wait with male strangers, and least 
willing to wait with female strangers. 


Variable III (Form of Promise) 


The difference between the number 
of immediate and delayed rewards ob- 
tained under the oral promise condi- 
tion and the paper promise condition 
Was nonsignificant when computed for 
xX (X = .002). In fact, the exact 
number of immediate and delayed re- 
wards chosen under the oral (29 and 
57) and paper (29 and 56) conditions 
was almost identical. Similarly, 8 
separate analysis for the six combina- 
tions for examiner and form of promise 
was nonsignificant (see Table 4; х” 
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ie = df = 5, p < .20). An analysis 
EBs riance of changeover scores (see 
in ^ 2) yielded nonsignificant F's 
m he form of promise when analyzed 
(p oss experience with the examiners 
= 0.00). 
ко, was a significant interaction 
ens ‚ on changeover scores for sex of 
dente and form of promise (see 
"n е 2; P = 3.81, p < .05). Observa- 
o of the data indieates that male 
mcm tended to elicit higher 
e pea scores under the oral con- 
"om than did female examiners, 
die n female examiners tended to 
ires much higher changeover Scores 
the m He paper condition than did 
Brie e examiners. Thus, for female 
stren ма alone, a paper note 
tà wy hens the children's willingness 
is ше deum reward. 9 

у y chi-square of other 
е (attitude toward promise 
s] Loe time estimation, ratings of 
ше) dh IQ, age, and grade place- 
ences revealed no significant. differ- 
(уз der that first graders differed 
choice 97; p « .05) on the type of 
Tale аа when it was offered by 
gers. si and by female stran- 
the dela, ey were more willing to choose 
by a m yed reward when it was offered 
male Ra e stranger than when by a fe- 
hold foe ees This finding did not 
signi tma E graders. There was no 
obtained difference in the choices 

by female strangers as con- 


Numam TABLE 4 

ER OF 

IN те IMMEDIATE AND DeLay CHOICES 
MACH EXPERIMENTAL CONDITION 


Fon 
™ of а 
Promise and examiner |Immediate| Delay 


Маје ога] 
ЖЕ male oral 10 19 
à ü 
маец teaeher oral 13 18 
Fen Paper 2 6 20 
Stug € pa Т 2 
Udent eect 13 12 
acher paper 9 19 
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trasted with female student teachers 
when compared separately by grades. 


DISCUSSION 


Results of this investigation show 
that the probability of an S choosing 
an immediate or a delayed gratifica- 
tion is a function of the characteristics 
of the person offering the choice and 
making the promise. The author pre- 
fers to view the response of the child 
in terms of “trust,” but there are cer- 
tainly alternatives. When S is placed 
in situations resembling the immedi- 
ate-delayed reward choice, his re- 
sponse will reflect, among other things, 
the expectation he has that the prom- 
ised larger reward will, in fact, be 
delivered. That aspect of such situa- 
tions can be separated conceptually 
from S's ability or inability to evalu- 
ate the period of delay, the relative 
worths of the rewards offered, ete. It 
is just that expectancy of fulfillment 
that defines trust as a response. 

The most important finding here is 
that unfamiliar women are trusted 
less than either familiar women or un- 
familiar men. Of course it must be 
recognized that "familiarity" is, in 
this investigation, a rather gross de- 
scription that is confounded with other 
variables such as teacher-nonteacher 
status and the like. Nonetheless, it 
appears to the writer that the strange- 
familiar dimension must be the most 
salient one for first- and second-grade 
children taken from their classrooms 
by an adult. It does not seem surpris- 
ing that children should be more will- 
ing to risk delay of gratifieation more 
with their own teacher than with a 
strange female. What is more difficult 
to comprehend is why delay is more 
acceptable when the agent is a strange 
male. At present we can do no more 
than speculate, but two ideas that 
present themselves are that: (a) 
Males are generally more prestigeful 
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in an elementary school situation and, 
pee may be more trustworthy, and 
(b) the nature of a child's interactions 
with male and female figures almost 
insures that the female will have had 
more opportunity to break promises, 
for example, mothers must, perforce, 
break what were to the child consid- 
ered ironclad agreements. = 
One additional—and intriguing— 
possibility is that for most children 
the choice of an immediate reward 
represents a self-indulgence which 
they are not always permitted. That 
is, we may put the emphasis on the 
gratification of an immediate reward 
rather than upon the somewhat puri- 
tanical promise of pleasures to come. 
In that case, we could ask what kinds 
of persons are likely to elicit a self- 
indulgent and perhaps immature re- 
sponse from a child. The writer does 
not find it a strain to suppose that 
the warm and friendly Strange female 
graduate students employed in this 
investigation would foster more self- 
indulgence in a child than either a 
teacher or a strange male. More re- 
search is called for on this point. 
The failure in this investigation to 
find a relationship between the age 
and IQ measures and reward choice 
1s not consistent with the findings of 
other investigators (Melikian, 1959; 
Mischel & Metzner, 1962), but the 
sample used here was quite homogene- 
ous with respect to age and probably 
with respect to intelligence also, 
The “paper promise” proved no 
more effective than the verbal promise 
in eliciting delay choices, a finding 
which might be explainable either in 
terms of the unfamiliarity of the chil- 
dren with promissory notes or in terms 
of a possibility that trust was not 
really so much at issue. It is of inter- 
est that the use of the note increased 
delay choices when the agent was a 
strange female, but that finding will 
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need elucidation in other research be- 
fore it becomes clearly interpretable. 
It makes sense, however, as a manifes- 
tation of bolstering trust. 
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REFINEMENT AND TEST OF THE THEORY OF FLUID 
AND CRYSTALLIZED GENERAL INTELLIGENCES 


JOHN L. HORN 
University of Denver 


AND 


RAYMOND B. CATTELL 
University of Illinois 


The 23 factors previously identified as representing primary mental 
abilities and 8 factors previously defined as general personality dimen- 
sions were factored, using à sample of 297 adults, to provide evidence 


for hypotheses stipulating that general visualization, fluency, and 
speediness functions, as well as fluid and crystallized intelligence func- 
tions, are involved in the performances commonly said to indicate in- 
telligence. 9 principal axes factors were sufficient to account for the 


observed, generally positive, intercorrelations among the 31 primary 
factors. These were rotated blindly to oblique simple structure. The 


resulting structure was consistent with predictions based upon refine- 


ments of the general theory of fluid and crystallized intelligence. Posi- 
tive manifold for the intercorrelations among the 2nd-order factors 
was interpreted as indicating a social fact of interdependence between 
intraperson and environmental influences determining behavioral at- 


tributes. 


“Pe theory of fluid and crystallized 
Eun te (Cattell, 1957, 1963; 
ogg) 1988; Horn & Cattell, 1966a, 
m ) seriously questions the notion 
sig, em is a unitary structure 
tel can be designated general in- 
wor е It questions the belief— 
in n implicit, but expressed clearly 
(1964 recent article by MeNemar 
ured Ска the conglomerate meas- 
collecti combining subscores from а 
est ae of intellectual tests is the 
ox Stimate of intelligence. But the 
at A also questions the contention 
cally | is theoretically and practi- 
of У 12056 worthwhile (at this stage 
“splint opment i this field) to 
into my the intellectual domain 
slight} very large number of narrow, 
Y distinct abilities, as in the 
work by Guilford (1959, 

1 erifield his coworkers (Guilford & 
issue = , 1960). The theory takes 
Benera] ith the popular belief that 
les ar Verbal and quantitative abil- 
Sen; 19 truly general, that is, repre- 
jo b, unitary concepts at а 
Vel of organization of all hu- 


BA ordo 
abilities. And although the 
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theory is in some respects similar 
to theories stipulating a distinction 
between general verbal-education 
(V:ed) and general practical-me- 
chanieal (k:m) abilities (Burt, 1949; 
Vernon, 1950) it differs from these 
conceptions with respect to both: (a) 
specification of the abilities which go 
into the major dimensions and (b) 
the development of abilities in child- 
hood and the adult years. Looked at 
specifically in terms of the concerns 
of this study, the theory of fluid 
and crystallized intelligence contains 
clear-cut statements about the ex- 
pected higher-order factorial struc- 
ture among primary mental abilities, 
that is, abilities of the type defined 
initially in Thurstone’s (1938) pio- 
neering researches and subsequently 
in many other investigations using 
similar methods (summarized by 
French, 1951; French, Ekstrom, 
& Price, 1963; Guilford, 1959, 1966; 
Guilford & Merrifield, 1960). 

The fluid-crystallized theory ar- 
gues that the primary abilities 
which can be said to involve intelli- 
gence to any considerable degree are 
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organized at a general level into two 
principal classes or dimensions. One 
of these, referred to as fluid intelli- 
gence (abbreviated Gf) is said to 
be the major measurable outcome of 
the influence of biological factors on 
intellectual development—that is, 
heredity, injury to the central nerv- 
ous system (CNS) or to basic sen- 
sory structures, etc. The other broad 
dimension, designated crystallized in- 
telligence (abbreviated Ge), is said 
to be the principal manifestation of 
a unitariness in the influence of ex- 
periential-educative-acculturation in- 
fluences. Each dimension is, according 
to theory, so pervasive relative to 
other ability structures and so ob- 
viously of an intellectual nature that 
each deserves the name "intelligence." 

Studies having particular relevance 

for the present research may be 
grouped into three classes: (a) fac- 
tor analytie investigations at what 
may be regarded as the second (or 
higher) order among primary ability 
factors, (b) factor analyses of the 
Wechsler scales, and (c) results from 
work employing hierarchical methods 
for defining dimensions among abil- 
ity performances. Horn (1965b) has 
reviewed the findings from studies 
in these various areas and has 
pointed out that these imply refine- 
ment of the general theory. Some of 
the prineipal points of refinement, 
and thus some of the principal hy- 
potheses of the present study, may 
be summarized as follows: 

Gf and Ge should be seen as only 
two among several general influences 
manifested in performance on puta- 
tive tests of intelligence. Among these 
other influences are: 

1. General visualization (Gv), a 
process involving visual acuity, sen- 
sitivity to light, breadth and width 
of visual field, depth perception, ete., 
and manifested in speeded perform- 
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ances of visualizing the movements 
and transformations of spatial pat- 
terns, maintaining orientation with 
respect to objects in space, unifying 
disparate elements and locating a 
given configuration in a visual field. 
The hypothesis implied is that Gv 
will be strongly manifested through 
primary factors like Vz (Visualiza- 
tion), Cf (Flexibility of Closure), 
Cs (Speed of Closure), DFT (Figural 
Adaptive Flexibility) and S (Spatial 
Orientation), but that it will be in- 
volved to a lesser degree in the other 
primaries involving spatial materials, 
such as CFR (Figural Relations), 
CFC (Figural Classifications) and 
Mk (Mechanical Knowledge). 

2. General fluency (F), a process 
reflected in tasks indicating facility 
in recalling and recognizing labels 
for cultural concepts irrespective of 
the subtlety of understanding of 
these. The process could relate to the 
size of store of language elements 
and/or to factors determining quick- 
ness in bringing such elements from 
long-term memory into immediate 
awareness. In any case it is to be 
supposed that the process will be 
manifested strongly in those pri- 
maries—like Fa (Associated Flu- 
ency), Fi (Ideational Fluency) and 
Fw (Word Fluency)—which re- 
quire speedy production of concept 
labels in a loosely structured area of 
meaning, but it will appear, if more 
faintly, in primaries like V (Verbal 
Comprehension) and EMS (Experi- 
ential Evaluation), since these re- 
quire some facility in recall of con- 
cept labels although the principal 
requirement of the tasks is not this. 

З. General Speediness (Gs), а 
quickness of performance involved to 
some extent in most speeded intellec- 
tual tasks but independent of the 
above-mentioned functions and not 
indicative of the capacity to perceive 
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үе. This quickness is expected 
sis E n most clearly in very sim- 
Rud asks, such as those of the pri- 
N (Ni P (Perceptual Speed) and 
or. umber Facility), but the func- 
eae be related, also, to the at- 
to de self-evaluations, etc., found 
UL. Sune UI-16 (Assertive Ego) and 
a (Exuberance). 
am same space as the above di- 
sent ona; fluid intelligence will repre- 
imi MAE of reasoning in the 
ing po Situation in tasks requir- 
and ва concept formation 
and ee and the perception 
ч, uction of relations. It will be 
Serres most purely when task 
lis гне | are culture fair; that is, 
all undaments are either novel for 
аза ТОН being measured ог else 
dance d common, overlearned 
Саны, of a culture, and the aids 
in а" 1963; Horn, 1965) needed 
&vailahl: solutions are not those made 
Portu ble, by favored educational op- 
B Prud to some persons and not 
hos ers among those measured. 
be m it can be predicted that Gf will 
Stee Кезец in the reasoning pri- 
CMR aoe CFC, I (Induction) , 
era] emantic Relations), R (Gen- 
duae NE, and Rs (Formal 
rimas 18) and it will appear in 
ain res requiring the subject to re- 
ness ements in immediate aware- 
Ma’ Such as Ms (Memory Span), 
Ssociative Memory), and the 


Sim х 
Ple Clerical Speed primaries, P 


and N. 
с the same level of generality as 
Set of į Per ce an equally broad 
izeq in ellectual tasks, the crystal- 
Cate py elligence function will indi- 
ety Ww of awareness and sub- 
Concepts elations previously perceived, 

indic previously attained, etc., 
"cop iti ated in tasks requiring 
tions ies or recall of such rela- 

> Dut it will also represent abil- 


ity to reason in the immediate situa- 
tion in tasks requiring abstracting 
concept formation and attainment, 
perception, and eduction of rela- 
tions. Thus, in contrast to Gf, Ge will 
be measured most purely under con- 
ditions in which the subject must use 
concepts and aids representing rela- 
tively abstruse elements of the collec- 
tive intelligence of a culture. There- 
fore Ge will be manifested in “recall” 
primaries, like V (Verbal Compre- 
hension) and Mk (Mechanical Knowl- 
edge), and also in reasoning prima- 
ries, like CMR, CMC, R, Rs, J 
(Judgment), and EMS (Experiential 
Evaluation). 

This development of the theory in- 
troduces a notion of alternative 
mechanisms in the performances that 
are commonly assumed to represent 
the operation of intelligence. That is, 
many intellectual tasks allow one to 
employ either fluid intelligence or 
crystallized intelligence to arrive at a 
correct answer. For example, in a 
problem like this: 

In a hospital containing 100 patients some 
are one-legged but wearing shoes and half 
of the remainder are barefoot. How many 
shoes are being worn? 

e an elementary in- 
largely representa- 
lligence or he may 
f formal algebra, 
the collective in- 


a person may us 
duction process 
tive of fluid inte 
use the techniques 0 


these being part of 
telligence of our culture and thus of 


Ge. A primary factor which involves 
tasks of this type, such as R, may 
therefore be expected to relate 
about as much to Gf as to Ge. On 
this basis, then, it can be predicted 
that the primaries R, CMR, CMC, 
Rs, and J will help to define both 
fluid and erystallized general intelli- 
gence, as suggested previously. On 
this basis, too, it can be predicted 
that primaries allowing for use of 
both visualization and fluid intelli- 
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gence processes, like CFR and CFC, 
will be involved in the factorial iden- 
tification of both Gv and Gf. zd 
larly, the fluency primaries may 
еей to relate to both Се and 
FE. H 

In the only previous study con- 
cerned specifically with the concepts 
of fluid and erystallized intelligence, 
Cattell (1963) found two second-or- 
der dimensions which could be inter- 
preted in accordance with hypotheses 
about the processes of Gf and Ge. 
However, the sample of primaries 
in this study was of very limited 
range and not such that it would per- 
mit test of several of the hypotheses 
outlined or implied above. For ex- 
ample, the factor Cattell identified 
as Gf, involving CFR, CFC, DFT, 
and §, would seem (according to the 
above refinements) to be confounded 
with Gv; the factor he interpreted 
as Ge, defined by V, N, and I was 
narrow relative to the general hy- 
pothesis stated above; the hypothesis 
of alternative mechanisms could not 
be examined and no Gs or F factor 
was indicated. It is of some interest 
therefore, to see if the theory can 
be generalized in the way suggested 
here. 

More than this, it is of some in- 
terest to explore the second order fac- 
torial structure of a broad sample of 
primary ability factors whether or 
not the Gf-Ge theory is taken as the 
motivation for this. For although 
there have been several factorial 
studies at what can be regarded as 
the second order, as reviewed by 
Horn (1965b), none of these em- 
ployed a representative sample of 
primary factors presently known and 
replicated, none was designed specifi- 
cally to reveal intricacies of higher 
order organization (with the result 
that adequate hyperplane material 
was usually lacking), and in most 
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cases the primaries themselves were 
rather narrowly defined. 

The major purposes of the present 
study are thus indicated: to illustrate 
the structure in a comprehensive 
sample of primary mental ability 
factors with the aim of determining 
whether or not this is generally con- 
sistent with the process descriptions 
outlined in the above refinement of 


the theory of fluid and crystallized 
intelligence. 


OPERATIONAL DEFINITIONS OF 
CRUCIAL VARIABLES 


Horn and Cattell (1965) have re- 
cently reemphasized the desirability 
of using more than one test to meas- 
ure and represent a primary factor. 
If a single test is used, the resulting 
measure is likely to represent the test 
specific factor to a considerable ex- 
tent and to contain substantial vari- 
ance on an unwanted vehicle factor. 
To heed this advice presents prob- 
lems when the aim, as in the present 
study, is representatively to sample 
from a broad domain of factors and 
yet testing time must, for various 
reasons, be limited to no more than 
five hours. In this study these prob- 
lems were solved, though not ideally, 
by using two or more tests to meas- 
ure most of the primaries having 
pointed relation to the major hy- 
potheses and measuring the remain- 
mg primaries with one test which 
previous research had shown to be & 
relatively pure marker for the factor. 

Cattell (1963) pointed out that in 
order for Gf and Gc general ability 
imensions to be uniquely defined by 
а simple structure it is quite neces- 
sary for an analysis to involve non- 
ability primary factors—to furnish 
hyperplanes against which to rotate 
the ability factors. Consideration 
must be given to sampling not only 
from the ability domain but also 
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from the domain of temperament 
ты general personality factors. In 
thi present investigation roughly one- 
À ird of the primaries included for 
analysis were taken from the Q-data 
iUm Or were general objective test 
actors believed to be large independ- 
m of the processes of intellectualiza- 
lon, as such. 
he tests used to achieve measure- 
x of the primary factors are listed 
ie 1 brief descriptions in Table 1. A 
labis following a variable title in the 
tek e indicates that the variable was 
tide, either from a test published 
Sh ег the authorship of the person 
study name is listed or else from a 
m Y reported in the research litera- 
libe, by this person. The abbrevia- 
use " for primary factors are those 
ben by French et al. (1963) if these 
ors list the factor. If they do not, 
ins the abbreviation is taken either 
LU French (1951), Guilford and 
errifield (1960), or Horn (196b). 
in n the listings defining the “Scor- 
e R means number right, W 
га number wrong, № means num- 
ne responses given in specified 
eub, а speeded open-ended test, K 
а gs 8 score determined according to 
bole ifie key, and the other sym- 
whidh eee indicate the way in 
= А variable was obtained from 
f оңа kinds of scores. Thus 20 — 
м a example, means that the num- 
Was op ong answers given in a test 
in ore tracted from the constant 20 
ure E to obtain a variable meas- 
dicate Which a high score would in- 
Wrong) good” performance (ie. few 
„9н of the reliability estimates 
Separate] Table 1 were based upon 
Tue ln Y-timed half-tests. This was 
Speedeq All eases where a test was 
include ge work time does not 
struction E time required to give 


Test scores were combined in the 
manner indicated to obtain the pri- 
mary factor scores that were actually 
analyzed. In each case the test vari- 
able was converted to standard score 
form and added in with unity nomi- 
nal weight. 

The symbols of the column to the 
far right in Table 1 indicate the ma- 
jor second-order hypotheses for each 
of the primary factor variables. The 
order of listing of these symbols in a 
partieular row indicates primacy— 
that is, the first-listed is considered 
to be the major hypothesis; the sec- 
ond indicates a lesser hypothesis. 
PSI in this column stands for a sec- 
ond-order positive self-image dimen- 
sion found among nonintellectual 
primaries (Horn, 1963). The other 
symbols were defined above. 

It is likely that most investigators 
will accept this sampling of primaries 
as reasonably representative of what 
is now known about first-order fac- 
tors. Of course, there are other 
fairly well-established primaries which 
could, with justification, have been 
included in this study. Neverthe- 
less the sample here employed is 
at least somewhat stratified to cover 
the area of cognitive abilities and, in 
any case, is the most extensive and 
representative yet achieved in this 
kind of study. 

METHOD 


Subjects 

The general theory argues that the in- 
fluences which produce the distinction be- 
tween Gf and Ge operate through develop- 
ment. As development proceeds, the Gf-Ge 
distinction becomes clearer. A detailed 
resentation of the facts and theoretical 
considerations which support this position 
has been provided elsewhere (Horn, 1965b). 
In general, however, this conclusion follows 
from the fact that although educative 
and nutritive influences are to some extent 
interdependent and mutually supportive, 
there is no necessary relationship between 
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Primary FACTOR VARIABLES AND TESTS USED FOR ESTIMATION 


TABLE 1 


Matrix 
number 


1 


2 


10 


il 


16 


TOR 
M ‘work e; i 
Primary ie PAR] Test ete and эше | ue | HE SREY) p 
onds) 
Induction $ Letter Grouping (Bot- | 67 240 24 Gf, Gs 
zum) 
Number Series (Bot- | 59 244 20 
zum) 
Intellectual Speed | 5Р | Test Am ешеш $$ | 200. | 20 i St Gs 
neaux, 
Carefulness c Test B(1) Series (Fur- | 57 200 20 20— W Gs(—) 
neaux) 
Figure Classify (Cat- 45 240 14 20— W 
tell III) 
Practical Estimates 420 20 20— W 
(Guilford) 
Subtracting (Horn) 75 60 20 9-W 
Dividing (Horn) 66 60 20 20— № 
Fractions-Decimals 180 20 20— 
(Horn) 
Intellectual Level | L Test B(2) Series (Fur- | 72 900 35 AR Gf 
neaux) 
Figural Relations | CFR | Figure Series (Cattell | 69 360 24 R Gf, Gv 
II, III) 
Topology (Cattell II, | 65 300 16 R 
ш) 
Matrices Speed (Cat- | 56 180 12 R 
tell II) 
Matrices Power (Cat- | 53 360 13 R 
tell III) 
Figure Classify (Cat- | 52 240 14 R 
tell П) 
General Reason- | R Problem Solving 70 600 12 R Gf, Ge 
ing (Horn) 
Adaptive Flexi- DFT | Match Arrangements | 09 420 20 R Gv, Gf 
bility (Guilford) 
Spatial Orienta- |8 Cards (Botzum) 86 240 54 n Gv 
tion 
Figures (Botzum) 85 | 240 52 R 
Visualization Vz Form Boards (Thur- | 81 240 28 R Gv 
stone) 
Associative Mem- | Ma Cued Nonsense Mem- | 68 280 20 R Gf 
ory ory (Horn) 
Cued Meaningful 87 288 26 R 
Memory (Horn) 
Semantic Rela- CMR | Common Word Anal- | 65 390 30 R Gf, Ge 
tions ogies (Horn) 
Abstruse Word Anal- | 67 390 30 R Ge, Gf 
ogies (Horn) 
Verbal Compre- | V Vocabulary (Horn) 93 | 900 50 R Gc 
hension 
General Information | 84 | 420 28 R 
(Horn) 
Mechanical Knowl-| Mk | Mechanical Informa- | 71 | 150 15 R Ge 
адве tion (AAF) 
Tool Identification 72 | 150 15 R 
(AAF) 
Formal Reasoning | Rs False Premises (Thur-| 41 360 22 R Ge, Gf 
stone) 
Influences (Guilford) | 66 480 14 R 
Experiential Eval-| EMS | Social Situations 61 360 16 R Ge 
uation (Guilford) И 
Associational Flu- | Fa Controlled Associa- 73 180 N F, Ge 
ancy tions (Thurstone) 
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Table 1—Continued 


9 


Matrix P bá Total 
numbe; rimary ti! rimary| > work 
T ary title Керр] Test title and source | tax | time Number Scoring Hypotheses. 
onds) 
17 | Ideati à Н 
deational Fluency] Fi Things Round (Tay- 180 N F, Ge 
lor) 
Wer up the 180 N 
18 pa dder) (Taylor) 
Number Facility | N Adding (Horn) "n 70 20 R Ge, Gs, Gf. 
Multiplying (Horn) 90 20 R 
Mixed Operations 150 | 2 R 
9 |g (Horn) 
Speed of Closure | Cs Backward Reading 89 105 40 R Gv, Gs 
(Botzum) 
Street Gestalt (Bot- 78 210 21 R 
20 Flexibili zum) 
e of Clo- | Cf Designs (Botzum) 85 | mo R-W-+20 | Gv, Gs 
21 
Speed of Copying | Sc Forward Writing (Cat- 30 N Gs 
tell) 
Forward Printing 30 N 
22 Writ; (Cattell) 
riting Flexi- Wt — | Backward Writing ss | 120 N Gs 
a |f (Cattell) 
ereeptual Speed | P Match Letters & 81 90 | 50 R Gs 
Numbers (Cattell) 
Rapid Cancellation 86 90 R 
24 (Cattell) 
Assentive Ego Ul6 | Agree With Plati- 75 | 120 | 20 K Gs 
tudes (Cattell) . 
Highbrow Tastes т | 180 | 2 к 
25 (Cattell) А 
Exuberance U21 | Experience Claimed 76 | 180 20 K Gs 
(Cattell) 
Confidence in Skill ss | 180 | 20 K 
26 (Cattell) И 
Self-Sentiment U36 Utilities Choice (Cat- 49 150 16 K Ge, PSI 
tell) " 
Autistic Distortion 43 150 16 
27 (Cattell) " 
Early Risks ER | Early Risks Claimed | 80 135 | 26 K PSI 
28 (Horn) x 
Unbound Anxiety | U24 | Test Anxiety Gara- | 82 о | 30 K PSI (-) 
son) 
20 16 PF. Q4 (Cattell) 
з | Adventurousness |H | 16 PF. H (pate | 73 | 190 | 2% Е Е 
aj |&Perimenting | Qı 16 РЕ. Qı (Cattell) ea | 150 | 20 E NA 
Sensitivity Ip | 16 PF. I (Cattell) sz | 150 | 20 ч 
the | 
i wherein a wide variety of background in- 
ave operated to produce 


at 

vill the distinction between Gf and Ge presently measure 

Will beco cult to draw in early childhood, tional opportunitte 

develop Ome gradually more clear-cut as capacity, as гер 

and wi еп proceeds through childhood ment of Gc will ten 

‚ The „0те most evident in adulthood. the development o! 

View; S neral theory also argues that in between the two w1 
empirical anal 


ing tj : 2 
9f vi = Gf-Gc distinction from the point 
= individual differences study, it 

st clear-cut in a sample of people 


Will rou 


om 


К ects aus over time their cumulative 
h n tend to independence. It follows 


fluences would h 


dera 
of backgroun 


f Gf 


yses, But when 
bly with respect to a variety 
d influences, then it is likely 


ble abilities. When educa- 
s are closely tied to 
resented in Gf, the develop 
d to be closely tied to 
and the distinction 
ll be difficult to draw in 


individuals 
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i any cases educational opportunities 
b ies сут on many factors other than 
the capacity represented by Gf. Hence the 
distinction between Gf and Gc will show up 
most clearly in analyses with samples 
wherein there is MM with respect 

round variables. 

B ime considerations led the authors to 
seek a sample of adults from various walks 
of life, a sample that was nearly as hetero- 
geneous as “adults-in-general,” if not truly 
representative of this population. - 

The tests listed in Table 1 were adminis- 
tered to a total of 297 persons: adults and 
older adolescents at the Stateville, Pontiac, 
and Dwight State Prisons in Illinois; the 
Illinois Soldiers and Sailors’ Children's 
School; Canon City Penitentiary in Colo- 
rado; and the unemployment rolls at the 
Colorado State Employment Office in 
Denver? Of the total, 215 were males. All 
subjects were volunteers, They were offered 
information about their performances as an 
inducement for volunteering and for doing 
their best. 

The age range was 14 to 61 years. How- 
ever, there was only one 14-year-old, two 
persons aged 61, one aged 56, one 55, and 
one 52, the bulk of the sample thus being 
between 16 and 51 years of age. The mean 


for age was 27.6 years, The standard devia- 
tion was 10.6 years, 


Test Administration Procedures 


The tests were administered in two ses- 


sions, each lasting approximately 2% hours, 
one in the morning and one in t 


noon. All subjects did the tests in 
order, although the break points 
the morning and afternoon sessions 
precisely the same for all groups. The same 
person administered the tests and the same 
stop watch was used throughout. Every 
effort was made to keep the conditions of 
administration the same for all groups. 
The subjects were very cooperative. As 


between 
were not 


‘For invaluable help in Securing this 
sample of subjects we particularly thank 
Julia Bates, staff psychologist at the Il- 
linois Soldiers’ and Sailors’ Children's 
Home; Arthur V. Hufiman, State of Illi- 
nois Criminologist; Wilson Meeks, Chair- 
man of the Classification Board at State- 
ville-Jolict Prisons; Stow E. Syman, 
Sociologist at Pontiac Prison; Bernard 
Robinson, Sociologist at Dwight Prison; 
George Levy, Senior Psychologist at Colo- 
rado State Penitentiary, and David J. Wil- 
son, Denver Department of Welfare. 
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nearly as it could be judged from “involved 
observations,” their motivation was gen- 
erally quite good in spite of the fact that 
the testing sessions were rather long. 


ANALYSES AND RESULTS 


The 31 primary factors listed in 
Table 1, along with age and years of 
education, were intercorrelated by the 
product-moment formula. The re- 
sults are presented in Table 2. With 
this size sample a correlation of 
about .11 or .12 is significantly dif- 
ferent from zero at the .05 level; an 
7 of .15 is similarly significant at the 
01 level. Using Hotelling’s (1940) 
test, a difference of about .15 be- 
tween two correlations would gener- 
ally be significant at the -01 level. 

Factoring proceeded with the first 
31 variables of Table l—that is, ex- 
cluding age and education, since 
these are nonbehavioral variables, 
logically distinct from the perform- 
ances represented by the measure- 
ment operations of the other varia- 
bles. The number of factors was 
estimated by use of what may be re- 
ferred to as the Kaiser-Dickman- 
Guttman (KDG) rationale (see 
Horn, 19652). This criterion sug- 
Eested nine factors. Principal factor 
calculations were repeated 25 times, 
each time replacing the diagonal ele- 
ment for a variable with that varia- 
ble’s communality estimate from the 
Previous calculations, Since the com- 
шипа ев determined on the 25th 
iteration differed from those deter- 
mined on the 24th iteration by only 
trivial amounts in the third and 
fourth decimal places, factoring was 
stopped at this point. The correlation 
matrix reproduced from the resulting 
nine factors very closely approxi- 
mated the origina] correlation matrix 
in all off-diagonal entries, there be- 
ing no residual larger than .10. The 
trace in the reduced space was 16, 
indicating that common variance was 


TABLE 2 
INTERCORRELATIONS OF ÅBILITY AND PERSONALITY FACTORS 


Communalities in diagonal 
Matrix number Symbol 
112 [3 + {5 6ут јао |10 [11 [121 13] 14] 15 16 | 17 | 18 19 | 20 
1 I 64 
2 SP 39| 39 
3 c 12/—10| 48| 
4 L 53| 36] 30| 57 
5 CFR 67| 39) 21) 53| 75 
6 R 42| 11| 21| 36) 30| 36 
7 DIT 40| 24) 13| 34) 46) 27| 35 
8 5 46| 31| 03| 25| 49) 19| 39| 53| 
9 Va 45) 22| 11) 36) 54| 32| 44) 47| 58| 
10 Ma 37) 28| 10) 35) 46] 20| 26| 26| 32| 38 
u GR 48| 18| 22| 49| 47| 42| 35| 21| 42| 34| 73 
12 Y 31| 09| 27| 40| 25| 39| 27| 10| 35| 20| 76) 78 
18 Mk 34| 05] 07) 18| 27| 38| 32| 30| 41| 12| 46| 45| 62 
14 Rs 45| 21| 20| 39| 45| 36) 29) 14| 26| 33| 60] 55| 30| 51 
15 EMS 21| 12| 06) 19| 17, 21| 15| 14| 13| 14| 32| 44| 20) 35| 32 
16 Fa 08| 05| 04| 12| 13| 12| 11|—04| 19| 24| 35| 42| 13| 25| 23| 57 
и Fi 24) 19|—01| 24| 25) 13| 23| 13| 21| 26) 36| 34| 21| 28| 24| 53 46) 
18 N 45) 20) 13| 44| 33| 32| 33) 17| 32| 35| 45| 44) 30| 35| 30| 27 36] 53 
19 Сз 42| 33| 05) 38| 55| 18) 32| 42) 42, 42) 33| 23) 16| 31) 21| 25 26| 28| 52 
20 с 40| 20| 16| 36| 47| 25) 37| 38| 55| 34| 35| 28| 33| зц 19| 11 23| 32) 48| 47 
21 Sc 22| 23| 01| 21| 22| 05) 19| 18| 22| 30| 17| 18| 03| 19| 25| 25 41| 39| 20| 24) 58 
22 Wt 06] 25| ooj 03| 19—07) 12) 14| 12| 17|-02/-03,-15| o1| 13| 19] 26| 13| 29| 11| 49 
23 P 37) 40| ooj 37| 36| 15| 19) 30| 30| 36) 23| 22 05) 22| a2| 16| z0| 52| 431 27| 47 
24 016 03-02) 11) 07—08] 03) 021—12—111—12) 09| 19—02| o4| 10) 16| o3|-o0| 011-02-02 
25 U21 1| 19—02) 13) 22| os| 18| 11) 23) 10| 18| 14| O4| 10 os| 22| 211 21| з1| 16 23 
26 U36 мү os| os| 14| 03) 00] 12) os| 09) O5| 20| 221 24| 21| 24| 21| o9| 18| 10| o2| o5 
27 ER 18| 16) 03) 10| 32| os| 15) 28) 23| 20) 12, 04) 22| 20| 10| os| 101 oel 29| 15 os 
28 U24 [71-05-04 —18|-09|—18/—19|—11|-20|-04|—37|-39|—33|24|—14|-15|—17|-14| -10|—14|--09 
29 H 97| 181—0) 07) os| 02| 07) 05) 07) 15) 20) 18| 141 ri| o9 17 15 21) 16! oi| 20 
30 Q 11—08] 22) 19) 13| 22) 19| 08) 17| os| 32| 42| 35| 23 20) 21| 18] 17) osl 20| o6 
31 ip —10| 01| 11| 08|—04|—07|—08|—19|—05|—02 00) 14—20 01 06) 22| oz|-os| 05/—02 oi 
32 Educ 27| 20 16) 38| 1| 20| 22) 03) 21| 08) 42) 53) 24| за] 27| 991 24| 49} 14) 24| 2% 
33 Age —12|-13| 14| 05/-29| 10| o4j—14| 03|—16| 22| 54| 20| 20| 29| 14| 05 20|—18| 05| oi 
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TABLE 3 
OBLIQUE (SIMPLE STRUCTURE) FACTORS 
melma) S| Ge | Gy | S | 7 | а | | ee | 
11 55 12 15 | —03 | —02 | —09 | —06 | —05 | —06 
2 |Sp 30 | —07 05 10 05 | —08 07 | —02 | —21 
s Ke 05 07 06 o7 | —05 | —14 12 06 62 
4r 47 12 08 04 03 | —11 12 | —07 18 
5 | CFR | 48| —08 $5 | —07 00 01 01 08 09 
6 R 26 30 06 | —04 04 | —03 | —12 | —02 13 
7 | DFT 01 09 42 03 00 00 | —01 05 03 
s |5 o4 | —04 48 08 | —07 | —30 | —07 04 | —09 
9 | Vz —06 07 50 02 | —05 08 | —04 | —04 01 
10 | Ma si | —02 05 10 05 15 | —09 04 02 
п |CMR| 30 50 06 | —13 07 29 02 | —01 02 
12 | Vv 08 69 | —06 00 00 26 10 | —02 09 
13 | Mk |-—15 48 15 | —08 00 04 | —28 08 | —07 
14 | Rs 29| 40| —10 | —06 00 19 | —04 12 05 
15 | EMS | —08 48 | —12 23 | = 05 04 08 | —02 
16 | Fa —10 35 | —06 07 01 60 16 02 | —10 
17 | Fi —03 25 02 25 01 42 | —04 | —03 | —10 
18 |N 20| 29 | —10 34 04 04 | —15 | —07 03 
19 | Cs 18 | —05 31 06 01 10 17 07 | —06 
20 | Cr —03 03 45 08 | —05 02 01 | —08 10 
21 | Sc —05 00 | —01 68 06 09 | —10 | —07 08 
22 | Wr |—13| —23 06 46 | —05 14 07 01 06 
23 | P 20 06 | —03 48 00 | —14 04 06 | —03 
24 |U16 | —04 18 | —08 | —05 15 | —07 48 | —04 08 
25 |U21 | —03| —08 10 10 23 12 24 15 04 
$ | ve j 0. 48 | -02 =) o| ж |.| — 3 
и датою =й | o|-4| 2|-E 
28 | U24 | —05| —26 | —07 06| —7 | —01 09 i 
29 |н -14| —06 | —12 07 82 00 | —01 © С 
aie (a) ж-ж e| lam | x x E- 
31 | Ip 04 04 | —02 | —04 | —15 20 64 | —12 07 
"Total 
Hyp. ct 16 15 20 24 23 18 19 26 
25 | 186 
Pot. ct 52| 48 64 77 74 58 61 84 81 | 66 
TABLE 4 
INTERCORRELATIONS AMONG SECOND-ORDER Facro: 
APPROXIMATION TO НїввАвситсдь Ga inen 
Symbol Gs Gv ER2 Gt PSI Gc PRM F С2(—) 
Gs 
Gv 37 
ER2 27 34 
Gf 39 34 17 
PSI 16 25 14 00 
Ge 10 36 | —01 16 33 
PRM 22 | 1] 12 00 10; | —ü 
F 22 09 06 20 06 | —12 04 
C2(—) 26 —02 19 —06 03 =12 09 —22 
Sums 1.99 | 1.62 | 1.28 | 1.20 | 1.07 | 0.69 | 0.45 | озз | 0.15 
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«айв of the total variance. The re- 
Me ie these analyses are pre- 
ADI? in a Table deposited with 
к achieve an approximation to a 
ple structure, Kaiser's (1958) 
ie procedure was employed and 
x was followed up with 13 graphic- 
xm. ee rotations. The Varimax 
| E o been deposited with 
ай ће oblique simple structure 
^ lon is given in Table 3. 
e S 3, the values in the row 
fms. Hyp. ct." are the actual 
ёа! te of loadings less than or 
"aad T 10 or greater than or equal 
"pot ` 0, and the values in the 
vance are the percentages of 
pla es in the +.10 hyper- 
T for the factors. 
€ intercorrelations for the ro- 
fated factors are given in Table 4. 


INTE 
NTERPRETATION AND DISCUSSION 
or RESULTS 


The С orrelations 


а Азов} All of the intercorrelations 
Exce e ability variables are positive. 
not рн are a —.10 correlation 
Speeq ignifieant) between Intellectual 
B а (Sp) and Carefulness (C) and 
low) Ъ correlation (significant but 
Mk) etween Mechanical Knowledge 
oso, 900 Writing Flexibility (Wi). 
ions e manifold for intercorrela- 
welle nong abilities is, of course, а 
established finding. 
ent aig. Шаг interest in the pres- 
Y are the correlations which 
2 
hag Mater 


bean supplementary to this article 


deposi А : 
оеша deposited with the American 
No. o9 ation Institute. Order Document 
or ев, ne ADI Auxiliary Publications 
mi, ogres: Otoduplication Service, Library 
mit in adog Washington, D. C. 20540. Re- 
fon bh es fe $1.25 for microfilm or $125 
br, hise Ries and make checks payable 
ATy of C Photoduplication Service, Li- 
Ongress, 


factors like L, I, and CMR—that is, 
hypothesized Gf markers—have with 
other factors. L tends to be speed- 
free. It was obtained from responses 
to items arranged in sets of five in 
each of which there was a continuum 
from very easy to very difficult 
problems, and the score is the aver- 
age level of difficulty of the prob- 
lems solved, not merely the number 
correct: A person can obtain a very 
high score by working through only 
a few sets of five and getting them 
all correct, whereas he will get a low 
score if he works many problems but 
gives incorrect answers for some of 
them. Both I and CMR seemingly 
involve some speed variance, since 
they were given under timed condi- 
tions. Yet the pattern of correlations 
between L and other variables is 
very similar to the pattern of corre- 
lations of CFR and I with other vari- 
ables, CFR and L, particularly, and 
I to a lesser degree, show relatively 
low correlations with the fluency fac- 
tors (the average r is .18) and with 
measures of knowledge, per se (e.g., 
Mk, EMS, and V, where the average 
r is .22, as well as formal education: 
average r = 21). The correlations of 
L with I and CFR are both .53 and 
these latter two correlate .67. Thus, 
overall, the suggestion is that factors 
like I, CFR, and L definitely do in- 
volve intelligence, as putatively rep- 
resented in other factors, but, rela- 
tive to these other factors, are 
somewhat more free of formal accul- 
turational influences. , А Р 
Many of the primaries said to in- 
dicate Ge—particularly V, EMS, 
and Mk—have positive correlations 
with both age and education, 
whereas some of the markers for 
Gf—notably CFR, Ma, and I—have 
negative correlations with age (—.29, 
—.16, and —12 being the correla- 
tions for the three primaries men- 
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ioned) though they correlate posi- 
ponis education (the values for 
the mentioned primaries being .19, 
.08, and .27). These findings thus 
again support the general hypothesis 
that the markers for Gf and Ge in- 
volve intelligence as this is generally 
rstood. 
othe fluency factors, Fi and Fa, 
sometimes treated as indicators of 
“creativity” (ef. Getzels & Jackson, 
1962; Guilford, 1962; Taylor, 1964), 
have rather low correlations with 
putative measures of intelligence (the 
average r for Fa and Fi being .18 
with I, L, and CFR and 25 with R, 
V, Rs, and EMS). Thus, if one were 
to favor the currently popular view- 
point that creativity and intelligence 
are largely independent, he would 
seemingly find support for his posi- 
tion in these results. But Fa and Fi 
show rather high correlations (aver- 
age .28) with simple Speed of Copy- 
ing and Writing Flexibility, neither 
of which is regarded as a good meas- 
ure of creativity (or intelligence) . 
The suggestion is that a substantial 
portion of the reliable variance of 
Fa and Fi does not reflect creativity 
at all, but indicates motor speed, the 
rather simple crystallized skills of 
cursive writing, and the temperament, 
factors associated with these meas- 
ures (Fa and Fi correlate .22 and 21 
respectively with UI-25, Exuber- 
ance). This interpretation is consist- 
ent with previous findings and prob- 
ably needs to be considered within 
the context of a carefully argued 
theory of creativity, such as that 
which Burt (1962) has recently out- 
lined, 


The Factors 


The “solidness” of the positive 
manifold among primary ability in- 
tercorrelations is made very evident 
in the first principal axis factor. This 
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contains not only variables repre- 
senting noegenetic processes in ab- 
stracting, reasoning, knowledge ac- 
quisition, etc., but also nonintellectual 
variables such as anxiety (U24), 
Radicalism (Qi), Exuberance (U21), 
Self-Sentiment Development (U36), 
etc. Thus the factor represents a con- 
glomerate of intellectual and nonin- 
tellectual functions. 

As concerns the theory being de- 
veloped in this study, the Varimax 
solution provides a confirmation of 
the general hypothesis, but a con- 
firmation which is much clarified by 
the improved simple structure 
achieved by visual rotation. The first, 
very broad (poor simple structure) 
Varimax factor involves spatial visu- 
alization processes to a considerable 
extent, but it contains too much vari- 
ance in nonvisual primaries (such 
as CMR, Rs, and L) to give really 
clear support for the hypothesis of 
a second-order general visualization 
dimension. With further shifts to the 
more nearly adequate simple struc- 
ture provided by visual rotation? 
however, the ambiguities inherent in 
the Varimax solution are largely 
erased, as is consistent with Thurs- 
tone’s (1947) general theory of sim- 
ple structure. The third factor in the 
visually-rotated ^ solution provides 
quite definite support for the hy- 
pothesis of a general visualization 
dimension spanning Vz, 8, Cf, Cs, and 


* For readers not familiar with the pro- 
cedures of visual rotation, it should be 
pointed that in the crucial meaning of the 
term “objectivity,” this method of rotation 
is just as objective as analytic methods of 
rotation. All the rotator sees when he ro- 
tates are points and his objective is to 
shift factors so that a maximum number of 
points will have essentially zero projec- 
tions on reference vectors drawn at right 
angles to corresponding factors. He is not 
aware of which variable corresponds to 
which point, just as a maximizing formula 
in analytie rotation is not thus "aware.' 
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а, зла dipping into measures of Сї 
жо where these involve figural 
tie ae And what is illustrated with 
те also applies to the other 
solution The Varimax adumbrates à 
indents that is much more clearly 
Шеш ed after . visual rotation. 
nid. although it is instructive to 
вова. and compare the two rotated 
boton S, in the interest of efficient 
tated m option; only the visually ro- 
wh actors will be discussed in 
" follows. 
бор ШЕ factor in the oblique solu- 
(CER) mins figural reasoning tasks 
; Sp) ; symbolic reasoning tasks (I, 
CMR and semantic reasoning tasks 
Lr RE Intellectual Level 
Varia p AR little or no speediness 
tern m is prominent in the pat- 
tual’ s primaries involving intellec- 
р, mee as best represented by 
keep s also present. The capacity to 
i Pain fundaments in immedi- 
“А areness is represented in the 
ant, i of Ma and P. Most impor- 
ys e high-loaded variables are 
adulig) ^ as reflect (in samples of 
Unas advanced schooling and other 
ае, intensive acculturation, 
clear] e variables which do most 
кы reflect these influences are in 
this qiberplane of the factor. Hence 
урос 07 clearly corresponds to the 
tion, "sized fluid intelligence func- 


lo. 


F 
Pore dor II provides substantial sup- 
тувар, hypothesis specifying а 
imarie ed function. Of the various 
е said : In this battery which may 
Considerah involve intelligence to any 
Very. on le — degree, practically 
fleet inte which could be said to re- 
Bromine, ee acculturation has а 
The ight loading in this dimension. 
Dreh Eh loadings in Verbal Com- 
Genera] В, Semantic Relations, 
ing, TW easoning, Formal Reason- 
э indicate that this is truly an 
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intelligenee dimension, involving no- 
egenetic processes, but one which is 
very much nurtured by accultura- 
tion. 

On first consideration it might 
seem anomalous that Factor II in- 
volves nonintellectual primaries such 
as UI-36 (Self-Sentiment), Qi (In- 
dependence) and UI-24 (Integra- 
tion). But if we consider the kinds of 
influences which are said to result in 
Ge and those which produce the kind 
of self-sentiment and personality-in- 
tegration indicated by the nonintel- 
lectual variables, it can be seen that 
the two overlap to a very great ex- 
tent. It is difficult to specify which is 
the chicken and which the egg in this 
relationship, but the fact of the rela- 
tionship itself is consistent with the 
general theory. In fact, this same kind 
of alignment of self-sentiment and 
Qı with a crystallized intelligence 
function was found in a previous 
study (Horn, 1961) aimed at demon- 
strating the mutually supportive in- 
terrelationship between ego func- 
tions, the self-sentiment and educated 


intellect. 
McFarlane Smith (1965) has re- 
d an excellent histori- 


cently provide 
cal and theoretical account of the 
general visualization function, but 


the present results in Factor III 
would appear to provide the clear- 
est verification yet presented of the 
thesis (dating back at least to El 
Koussy’s, 1935, important results) 
that visualization abilities stand 
somewhat apart from the analytic 
reasoning and crystallized abilities 
of Gf and Ge. It can be seen here 
that Gv involves a kind of fluency 
in imagining the way objects may 
change as they move in space, main- 
taining orientation with respect to 
objects in space, keeping configura- 
tions in mind, finding the gestalt 
among disparate parts in а visual 
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field, and maintaining a flexibility 
concerning other possible structurings 
of elements in space. It will be noted 
that virtually every task which may 
be said to involve spatial content has 
ing in this factor. 

t "n teo is & noteworthy differ- 
ence between the results predicted by 
Smith's analyses and those actually 
obtained here. Working mainly 
within the framework of research 
done in Britain—as summarized in 
Vernon's (1950) book, for example 
— Smith tended to see general visuali- 
zation as an even broader factor than 
is here revealed. Indeed, Gf and Gv 
were not clearly distinguished in his 
treatment, whereas the present re- 
sults show that although the two are 
cooperative in several important 
tasks (notably CFR), they are nev- 
ertheless linearly independent 
(though correlated) at a general 
level. It seems likely on the basis of 
these results, however, that particu- 
larly gifted mathematicians and 
Scientists, such as Newton, would of- 
ten be highly endowed with both 
Gf and Gv (not to mention Gc), as 
Smith, in effect, argues on the basis 
of his analyses of biographical and 
other data. 
In Factor IV such variables as the 
speed of copying and perceptual 
speed suggest that the factor could 
represent a motivational, state-like 
function of effortfulness in the im- 
mediate testing situation. Introspec- 
tively it makes good sense that there 
is such variance. In previous re- 
search (Cattell, 1957) the personal- 
ity factor UI-21, Exuberance, has 
been identified with such “competi- 
tiveness" in the immediate situation. 
Factor VI is defined primarily by 
the flueney primaries, and these have 
relatively little variance in the speed 
component of Factor IV. However, 
the fluency primaries do have some 
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variance in the Ge-reasoning factor. 
Also, in addition to sheer fluency, 
Factor VI brings in some variance on 
CMR. Possible interpretations of 
these findings have been suggested 
by Christensen and Guilford (1963), 
who indicated that a factor like VI 
can represent: (a) size of a reser- 
voir of “ideas” or (b) speed in “call- 
ing up” the ideas. Of course, these 
two functions could be closely re- 
lated. Regardless of which emphasis 
is taken in interpretation, the present 
findings indicate that the reservoir- 
glibness function of Factor VI is 
largely independent of Ge: Crystal- 
lized intelligence involves fluency, to 
be sure, as represented by the load- 
ings of Fa and Fi, but the fluency 
primaries, particularly, and V and 
CMR to a lesser extent, measure a 
kind of reservoir flow in the use of 
concepts which is distinct from the 
thinking processes of Ge, 

The remaining factors in Table 3 
are of less concern in this study, al- 
though they are rather intriguing. 
Factor IX, for example, suggests 
that the carefulness factor, measured 
among intellectual tests, stands vir- 
tually alone at the second order in 
this rather broad sampling of both 
intellectual and nonintellectual pri- 
maries. The pattern of loadings is 
virtually identical to the pattern 
of correlations which Carefulness 
showed with other primaries, The 
Suggestion is that Carefulness (in 
the space of these variables) is not a 
first-stratum factor, but one which 
belongs at the second order. 

Factor IX is similar in some re- 
Spects to a second-order superego 
factor found by Horn (1961) in a 
sample of ability and dynamic moti- 
vation variables. If carefulness is 
taken as an indication of superego 
development, the suggestion is that 
the superego is involved in perform- 


с е 
FLUID AND CRYSTALLIZED GENERAL INTELLIGENCE 


ances in С 
feel Len pip Reasoning, Intellec- 
t bios | and Verbal Comprehension. 
dash spe e oppose what may be slap- 
ifa 11055, as represented in Sp. 
inia ewe also relates to the aca- 
( Bener s of values expressed in Qu 
an aie also supports 
development ion in terms of superego 
"a d 
ш X can be identified as the 
1964) har imension Horn (1963, 
тау of as found among a wide ar- 
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ean whe r what most investigators 
esirabilite they use the term “social 
Яев опт, But it is not the only 
given аш faetor which might be 
dimens is name. At least two broad 
f lOns are required to account 


or t] 

or the ET 
Sone ee between ques- 
T items, tests, and factors. 


e 
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(01 = by the general anxiety 
-24) and the threctia di- 
> Ж, Cattell & Scheier, 
e traversi also involves variance 
oral-et) on and in measurements 
ет ethical values, as defined in 
f + factor, 
TA ree Ne is defined essentially 
fe ( ioe (a) Premsia, UI- 
~=), Three Ego, and (c) Mk 
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А е Assertiveness with 
pe hanieat he association with low 
Th Very a has likewise 
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to beer, ballet to burlesques 

Mk, on the other ty the кенә 
shows himself (and the Aim is ap- 
propriate here) to know the differ- 
ence between a U-bolt and a yoke, a 
timing gear and a crankshaft, a key- 
hole saw and a rip saw, etc. 

Faetor VIII is a specific loaded 
only by Early Risks. There would 
appear to be no need to discuss this 
factor at this time. 


The Correlations between 
Order F'actors 

It will be noted that a positive 
manifold exists among the first six 
factors and that positive correlations 
generally prevail for all of the sec- 
ond-order factors isolated in this 
study. Thus it would appear that the 
main general factors isolated in this 
are not completely independ- 


Second- 


study 

ent, and that a more general inte- 

grating principle operates among 
the theory out- 


them. According to 
er and expressed in 


lined in this pap 

more detail by Horn (1965b), the 
principle implied here is to be under- 
stood in he interaction in 


terms of t 
development between various intel- 
lectual and nonin 


tellectual traits of 
the person and in 


terms of the inter- 
action of both educative an 


d nutri- 
tive environmental conditions. А 

To note the positive manifold in 
Table 4 is not to say that a single 
factor will explain the covariance 
there represented. Hierarchical order 
is not by any means evident in the 
matrix. It stionable whether 
such an order 8 


should be found, both 
on general t 


i resent i 

eta As concerns the theoretical 
issues, consider the 
itive manifold includes the positive 
self-image dimension. It is reason- 
able that this should be correlated 
with Gf and Ge, surely, and it is not 
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nreasonable that it should be cor- 
тебей. with Gv and Gs, but it does 
not follow that it should fall into a 
hierarchical order defining a single 
unitary function. Hierarchical order 
is a considerably more stringent re- 
quirement than is positive manifold. 
It implies one and, more importantly, 
only one influence or attribute in a 
set of observations; positive mani- 
fold indicates one influence, but not 
necessarily only one. While it is rea- 
sonable to argue that positive self- 
image stems from some of the same 
influences as Ge, Gf, etc., it is not so 
reasonable to argue that these vari- 
ous functions involve only one thing 
in common. The implication is that 
a multifactor theory is called for 
among these dimensions at the third- 
order. 

However, the technical issue al- 
luded to above introduces difficulties 
which make it awkward, at least, and 
perhaps impossible, to adequately 
explore possible multifactor hypoth- 
eses at the third order in the present 
study. Thurstone (1947) presented 
convincing statistical and logical ar- 
guments for the contention that sim- 
ple structure factors are relatively 
invariant with univariate and multi- 
variate bias in the selection of sub- 
jects (and the results in the present 
study appear to be consistent with 
his deductions). But by these same 
arguments he suggested that the cor- 
relations between factors will not be 
invariant and in fact will reflect 
sampling bias. Thus in the present 
instance, where it seems evident that 
there is some sampling bias, the ta- 
ble of intercorrelations between sec- 
ond-order dimensions is possibly 
quite ambiguous as concerns the 

third-order factors. For example, the 
correlation of Gv with Gf (and also 
with Ge and Gs) probably represents, 
in part at least, sampling bias in 


drawing male and female subsam- 
les. 

К In general, then, although trends 
are indieated in Table 4, the differ- 
ences between these correlations 
should probably be interpreted very 
cautiously. 


Integration of Results 


The findings, overall, thus indicate 
that a major proportion of the vari- 
ance observed in intellectual per- 
formances can be understood in 
terms of six major functions: 

1. Fluid intelligence, representing 
processes of perceiving relations, 
educing correlates, maintaining span 
of immediate awareness in reasoning, 
and abstracting in both speeded and 
unspeeded tasks of a relatively cul- 
ture-fair kind but involving figural, 
symbolic, and semantic content. 

2. Crystallized intelligence, repre- 
senting similar processes of perceiv- 
ing relations, educing correlates, etc., 
in speeded and unspeeded tasks in- 
volving various kinds of content, but 
tasks requiring considerable pretrain- 
ing to acquire techniques represent- 
ing the accumulated wisdom of a cul- 
ture. 

3. General visualization, repre- 
Senting processes of imagining the 
way objects may change as they 
move in space, maintaining orienta- 
tion with respect to Objects in space, 
keeping configurations in mind, find- 
ing the gestalt among disparate 
parts in a visual field, and maintain- 
ing a flexibility concerning other pos- 
sible structurings of elements in 
space. 

4. General Speediness, an attribute 
Measured in simple writing and 
checking tasks requiring little in the 
way of complex relation-perceiving, 
but an attribute producing variance 


in the measure of most intellectual 
factors. 
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when the connective is a 
preposition than when it is a conjunc- 
tion and still smaller when the con- 
nective is a verb. Finally, it was 
assumed that the probability of select- 
ing a correct response for a given 
stimulus noun increases as the size of 
the class of appropriate response nouns 
decreases. From these assumptions the 
prediction followed that the presenta- 
tion of paired nouns in the context of 
verbal strings would produce faster 
learning when the connective was 8 
verb than when it was а preposition 
and that а preposition connective 
would produce faster learning than a 


conjunction. 


smaller 


ment was de- 
aluate this constraint 
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implications for paired-associate learn- 
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TABLE 1 


STUDY-TRIAL MATERIALS: CONJUNCTION 
AND VERB STRINGS 


List A 


Conjunction 
The BEE and a DOG. 


A LION and the 
HORSE. 

The srick or the 
COW. 


А TRAIN and a CAT. 


The ANT or a 
MOUSE. 

A HUNTER and the 
SHEEP. 

The run. or the 
DEER. 

The LIGHTNING or 
the GOAT. 

A MAN or the FOX. 

An AIRPLANE and a 
PIG. 


Verb 

The BEE chases a 
DOG. 

A LION scares the 
HORSE. 

The stick hurts the 
cow. 

А TRAIN kills a CAT. 

The ANT bites a 
MOUSE. 

A HUNTER calls the 
SHEEP. 

The FUR covers the 
DEER. 

The LIGHTNING sur- 
prises the GOAT. 

А MAN helps the FOX. 


An AIRPLANE wakes a 
PIG. 


List B 


A SPIDER and a 
POLE. 

A WORKER or the 
STAIRS, 

A KNIFE and the 
SHIP. 

The втокм or a 
CHAIR, 

The GLUE or the 
LADDER. 

A BIRD or the 


FENCE. 


The 10g and a SIGN. 


The NAIL and a 
ROPE. 

The PENCIL or a 
ROCK. 

The Broom and a 
WALL. 


А SPIDER climbs a 
POLE. 

A WORKER Paints the 
STAIRS. 

A KNIFE carves the 
SHIP. 

The storm breaks a 
CHAIR. 

The GLUE fixes the 
LADDER. 


BIRD uses the 
FENCE. 
The icp cracks a 
SIGN. 


The NAIL holds a 
ROPE. 

The PENCIL marks a 
ROCK. 


The BROOM Sweeps the 
WALL. 


EXPERIMENT I 


Method 


Materials and design. All subjects. (Ss) 
were given the same task: to learn a list of 
20 pairs of high-frequency nouns. Each pair 
was presented in the context of a meaning- 


T 


ful, syntactically structured verbal string. 
wo sets of verbal strings were constructed: 


in one of these the form class of the connec- 
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tives was conjunction; and, in the other, 
the form class was verb. With the exception 
of the connectives, the Strings comprising 
the two sets were identical. 

Each set of strings was subdivided into 
two mutually exclusive subsets of 10 strings 
each according to the rule that all response 
nouns within one subset should be semanti- 
cally appropriate for every string in the 
subset and semantically inappropriate for 
all strings in the other subset. Thus, the 10 
nouns within each subset were completely 
interchangeable. This rule, of course, only 
applies to the verb strings since conjunction 
connectives provide no semantic informa- 
tion about subsequent nouns. All of the 
response nouns in one subset (List A) were 
names of animate objects capable of auton- 
omous locomotion. All of the response nouns 
in the other subset were names of objects 
incapable of autonomous locomotion (List 
B). The verb strings were constructed first 
and the conjunction strings were formed by 
substituting a coordinate conjunction (and, 
or) for each of the verbs. A complete listing 
of the study-trial materials may be found 
in Table 1. á 

The size of the class of response nouns 
was manipulated by varying the conditions 
of response mode. Half of the Ss were asked 
to recall each response noun when presented 
with the appropriate stimulus noun and the 
other half were asked to recognize the appro- 
priate response noun in a list of ten of the 
response nouns used in the experiment. 'The 
recall condition was essentially similar to 
the learning task in which the differences 
between the efiects of conjunctions and 
verbs was initially detected. The constraint 
hypothesis asserts that under this condi- 
tion the size of the Tesponse class, given 
verb strings, is Substantially smaller than 
that for conjunction strings. In contrast, 
the recognition condition was designed to 
equate the sizes of the response classes for 
both the conjunction and verb strings. This 
was done by presenting Ss in both the con- 
Junction and verb groups with identical lists 
of 10 alternative response nouns for each 
stimulus noun presented. All 10 of the alter- 
native nouns were semantically appropriate 
for the string in which the given stimulus 
noun was originally presented. For example, 
whenever a stimulus noun that had been 
paired with a response noun from the first 
subset was presented, the list of response 
alternatives consisted of the entire first 
subset. Thus, the verb strings provided no 
more semantic information as to which of 
the 10 response nouns was correct than did 
the conjunction strings. For the conjunction 
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TABLE 2 
MEAN NUMBERS or Correct RESPONSES 
as A FuNcrION or Form CLASS AND 
Responss Море (EXPERIMENT I) 


Form Class 
Response Mode Conjunction Verb 
M SD M SD 
Recall 13.60 | 6.87 | 21.47 | 7.86 
Recognition | 22.47 8.99 | 27.00 | 5.44 


MSE (1,56) = 27.44. 
f each slide at a 4-second 
rate. The experimenter (E) read each string 
aloud as it appeared on the screen. After 
the last string had been shown, the Ss were 
told to open their response booklets to the 
first page, look at the stimulus noun, and 
write their response in the space provided. 
The stimulus nouns were presented at a 15- 
second rate to permit the Ss sufficient time 
to select their responses, write them in the 
booklet, and turn the page in preparation 
for the next stimulus. As each stimulus was 
projected, it was read aloud by E. 

At the end of the first test trial the re- 
sponse booklets were collected and those for 
Trial 2 were distributed. The procedure 
followed in Trial 1 was repeated to the com- 

Jetion of Trial 2 except that the strings and 
the stimulus nouns were presented in an 
order different from that used in Trial 1. 


Results 


the projection o 


asured in terms of 
correct responses 
during the two test trials. 
The mean numbers of correct responses 
for each 0 the four experimental 
conditions are presented in Table 2. 
These results were assessed by means 
of analysis which revealed 


of variance | l 
significant main effects associated with 


reed crucial for the constraint 
hypothesis was not significant (F<), 
nor were any of the remaining tests in 
the analysis of variance. Verb strings 
produced substantially more efficient 
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learning than did conjunction strings 
and the recognition mode produced 
more correct responses than did the 
recall mode, as expected. Even though 
the interaction of principal interest was 
not statistically significant, the dif- 
ference between verb and conjunction 
strings appeared smaller under condi- 
tions of recognition than under condi- 
tions of recall as predicted by the con- 
straint hypothesis. Given this, it was 
decided to replicate the experiment on 
Ss expected to be more homogeneous 
with respect to learning rate than those 
sampled in Experiment I. 


EXPERMENT II 


The second experiment to be re- 
ported consisted of an exact replication 
of Experiment I augmented by an 
additional condition of Response 
Mode. Furthermore, the experiment 
was performed on both a sixth-grade 
sample and a fifth-grade sample of 
children, 

The additional experimental groups 
were added to provide a corollary test 
of the constraint hypothesis. Suppose 
it were shown that under the original 
recognition condition, the difference 
between verb and conjunction strings 
could be significantly reduced. Such a 
result would be consistent with the 
constraint hypothesis but it would be 
open to an alternative interpretation, 
namely, that some characteristic pecul- 
iar to the recognition procedure itself 
prevents the detection of a difference, 
For example, it has been shown re. 
peatedly that performance measured 
in terms of recognition yields higher 
estimates of the amount learned or 
retained than when recall is required 
(Postman & Rau, 1957). Thus it might 
be that differences at high levels of 
performance are not comparable to 
differences at lower levels of per- 
formance. In order to clarify the 
interpretation of such results, a second 
recognition condition was added. In 
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this case, however, the condition was 
designed to permit semantic constraint 
to operate and to produce again а 
difference between the conjunction 
and verb conditions. » 

The second recognition condition 
(Recognition 10/5) differed from the 
first (Recognition 10/10) only in the 
composition of the lists of response 
alternatives offered for each stimulus 
noun. In the Recognition 10/10 condi- 
tion, all of the response alternatives 
Were appropriate for both the con- 
junction strings and for the verb 
Strings. In the 10/5 condition, how- 
ever, the 10 alternative response 
nouns were chosen so that only five 
would be semantically appropriate for 
the verb condition while all 10 were 
appropriate for the conjunction condi- 
tion. Thus, the prediction from the 
constraint hypothesis was that verb 
strings would produce more efficient 
learning in the 10/5 condition than 
the conjunction strings but not in the 
10/10 condition. 


Method 


Materials and design. The task and the 
materials for the second experiment were 
the same as those for the previous experi- 
ment. The only exception was the composi- 
tion of the response sheets for the 10/5 
group. In the 10/10 condition, the 10 alterna- 
tive responses for each Stimulus noun were 
drawn from the same subset to insure that 
all would be semantically appropriate. In 
the 10/5 condition, the 10 response nouns 
consisted of five from one subset, the subset 
containing the correct response, and five 
from the other, inappropriate subset. For 
example, if the correct response noun were 
à member of the animate subset, five of the 
response alternatives were the names 9 
animate objects and the other five wert 
names of inanimate objects. 'The members 
of the two subsets were randomly ordere Я 
on each response sheet such that no clea 

rouping was apparent. "m 
а 1t wis deine to divide this i e 
ment into two distinct 2 X 2 designs hers 
the simultaneous analysis of all six Oe 
mental conditions involved the confound) is 
of two factors: the mode of response (Beer 
nition versus Recall) and, within 
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т T 
aD conditions, the semantic appro- 
junction s of the alternative responses (Con- 
The à versus Verb strings). 
Betiment i design was a replication of Ex- 
catory fact with the addition of two classifi- 
Biecision. 079 used to increase experimental 
Form Ql; n all, five factors were involved: 
Verb st ass (Conjunction strings versus 
Versus uL ; Response Mode (Recall 
versus Si Frogn tian 10/10); Grades (Fifth 
versus d ); Reading Ability Levels (High 
versis T. edium-High versus Medium-Low 
ands Sta Trials (1 versus 2). Perform- 
test, the кз from а standardized reading 
available tanford Achievement Test, were 
used to i on all Ss and this information was 
ine S levels design. The ex- 
Was thet we of principal concern 
More eflici ле verb strings would produce 
Strings UE learning than the conjunction 
lon condi he recall but not in the recogni- 
З ition. 

Was ra one exception, the second design 
ode teres as the first. The Response 
ecogniti or involved a comparison of the 
Nition 10/5 e 10/10 condition with the Recog- 
Broups in Danes the experimental 
Were inyol пе Recognition 10/10 condition 
Second 5» ved in both analyses. For the 
With «n cnalysis the prediction consistent 
strir constraint hypothesis was that 

tli cos would produce faster learning 
i 5 conjunction strings in the Recogni- 
/5 condition but not in the Recogni- 
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clasg 
отете es. Within each grade, Ss were 
Scores BRE respect to reading achievement 
Ss each id then divided into quartiles of 12 
Sach qu: “Wo Ss selected at random from 
each of AME were randomly assigned to 
Broup wae SIX experimental groups. Each 
the tyo 18 assigned eight Ss from each of 
Bren grades. 
Experiment. The procedures followed in 
ч ir оп were the same as those fol- 
h inistered p. ment I. The task was ad- 
a Js нуи a study-trial test-trial method 
ib cond i tested in groups of eight. A 
Ww Wo sin ee rate was used during 
Маи Used da Y trials and a 15-second rate 
Sain read uring the two test trials. The E 
tbPeared тв strings aloud to Ss as they 
thls and =a the sereen during the study 
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TABLE 3 
Mean NUMBERS ОЕ CORRECT RESPONSES as 
A Funcrion or Form CLass, RESPONSE 
Mope, ANp GRADE (EXPERIMENT II) 


Form class 
1 
Grade Response mode Conjunction Verb 


м |SD, M |SD 


14.63/3.58/20.63/6.87 


5 | Recall 
Recognition 
10/10 19.004.14/30.88/2.53 
7.65/30.63.4.57 


Recognition 10/5/21.00 
6 | Recall 13.78.4.92 28.004.38 


Recognition 
10/10 24.12/5.89/:20.384.03 


Recognition 10/519. 75/5. 78/32. 25/3 .06 


MSE (1,31) = 1241. 


Results 

As in Experiment I, the dependent 
variable in the present experiment was 
the number of correct responses 
emitted. The results are presented in 


Table 3. 


Design 1. An analysis of variance 


was performed on the data produced 
by the Recall and Recognition 10/10 
groups. The main effect for levels of 
reading ability was not significant 
(F = 1.59, df = 3/31, p > 10), nor 
were any of the interactions with levels, 
nor was the main effect for grades 
(F = 443, df = 1/31,.p > 05). Тһе 
main effect for trials was significant 
(F = 258.82, df = 1/31, p < :0005), 
but none of the interactions of other 
factors with trials was significant. 
The factors of principal interest 
produced notably large effects. Re- 
sponse Mode differences favoring the 
Recognition 10/10 conditions were 
significant (F = 28.02, df = 1/31, 
p < .0005) and Form Class differences 
favoring the verb conditions were 
significant (F = 56.26, df = 1/31, 
< .0005). But the interaction of 


Response Mode X Form Class, pre- 
dicted by the constraint hypothesis, 
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was not significant (F < 1). A higher- 
order interaction, however, required a 
reinterpretation of these effects: The 
interaction of Grades X Response 
Mode X Form Class was significant 
(F = 8.91, df = 1/31, p < 01). The 
form of the interaction was such that 
the prediction derived from the con- 
straint hypothesis was confirmed by 
the performance of the sixth-grade Ss 
and contradicted by the performance 
of the fifth-grade Ss. That is to say, 
the predicted interaction of Response 
Mode X Form Class was significant 
in the sixth-grade samples (F = 6.52, 
df = 1/31, p < .025) but not in the 
fifth-grade samples (F = 2.78, df = 
1/31, p > .05). Furthermore, the 
direction of this second-order interac- 
tion in the sixth-grade was opposite 
that in the fifth-grade. In the sixth- 
grade groups, the verb strings produced 
more efficient learning than the con- 
Junction strings only for the Recall 
conditions and not for the Recognition 
10/10 conditions. In contrast, the 
difference between conjunction and 
verb strings for the fifth-grade Ss was 
larger under the Recognition 10/10 
condition than under the Recall 
condition. In sum, the constraint 
hypothesis was firmly supported by 
the results in the higher grade and just 
as firmly denied by the results pro- 
duced by the younger children. 

Design 2. In the analysis of variance 
performed on the data produced by 
the Recognition 10/10 and Recogni- 
tion 10/5 conditions, only two signifi- 
cant sources of variation emerged: 
Form Class (F = 53.55, df = 1/30, 
p < .0005); and Trials (F = 173.21, 
df = 1/30, p < .0005). Neither of the 
two interactions that might have 
provided support for the constraint 
hypothesis reached statistical signifi- 
cance. The interaction of Response 
Mode X Form Class was not signifi- 
cant (F = 1.19, df = 1/30, p > .05) 
nor was that of Grades X Response 


Mode X Form Class (F = 3.64, df = 
1/30, p > .05), although the latter was 
relatively substantial. The simple 
effects of the interaction of Response 
Mode X Form Class was significant 
for Grade 6 (F = 4.50, df = 1/30, 
р < .05) but not for Grade 5 (F < 1). 
The latter result is, of course, con- 
sistent with the results reported in 
connection with Design 1, but never- 
theless, provides only weak support 
(confined to the sixth-grade sample at 
that) for the constraint hypothesis. 

No prediction relevant to this out- 
come had been made in advance and 
no convincing explanation could be 
developed post hoc. Before making & 
serious attempt to interpret the strik- 
ing differences detected in Design ! 
between fifth- and sixth-grade children, 
it was decided to replicate the entire 
experiment on other samples of fifth- 
and sixth-grade children drawn from 
a comparable population. To ac- 
complish this purpose, Experiment II 
was performed. 


ExPERIMENT III 


Experiment III differed from Ex- 
periment II in only two ways. First, 
the classification factor of reading 
ability levels was not used since it 
had proved of little consequence 1P 
the previous experiment. Second, the 
96 Ss were drawn from a differen 
school in another district because the 
population of fifth- and sixth-grade 
children in the district initially chosen 
had been virtually exhausted by the 
first two experiments. The school 
sampled in the present experiment was 
selected specifically because of its 
close comparability with the school 
sampled in Experiment II with respec 
to the tested intelligence and the 5008" 
class membership of its students: 
Apart from these two deviations 
Experiment III was an exact replica 
tion of Experiment II. 
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Results 


hn again the dependent variable 
emitte т йек сы of correct responses 
ria and 2 
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Vite 1. A four-way analysis of 
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but Шан hool used in Experiment II 
radical]; result for Response Mode is 
planais, different. We have no ex- 
lon аа the fact that the recogni- 
cient Ie itions produced no more effi- 
resent arning than the recall in the 
h experiment. 
once KS effect for Form Class was 
й pm significant, (F = 41.20, 
ассошц оа р < .0005) and by itself 
botal vari for more than 2596 of the 
wo свай observed. Clearly, the 
Produce erent kinds of connectives 
differ reliably and substantially 
a rates of learning. 
са] Ка? remaining effect of statisti- 
REY, cance was Trials (F = 230.63, 
interact, 6, p < .0005); none of the 
Of varia ns were significant sources 
Principal . ce. The two interactions of 
obtained interest in view of the results 
Mode x in Experiment II, Response 
€sponsge Form Class, and Grades X 
accounte > Mode X Form Class, each 
Variatio for less than 1% of the total 
than р ma produced F ratios smaller 
тези j he conclusion from these 
esis р. 15 Clear: The constraint hypoth- 
Perfo x SIDON whatever in the 
esi, of Ss in Experiment Ш. 
of tee 2. The results of the analysis 
Conte; Rce performed the data 
ained ; on 
Consisten, m Design 2 were entirely 
t with those reported in 


TABLE 4 
Mean NUMBERS or Correct RESPONSES AS 
A Funcrion or Form Crass, RESPONSE 
Море, AND GRADE (EXPERIMENT III) 


Form class 
Grade! Response mode Conjunction Verb 
M |SD| М |SD 
5 | Recall 16.88/6.83/25. 25/8. 29 
Recognition 
1 19. 63/7 .05/26 . 63/6. 37 


0/10 
Recognition 10/5/20.50/6.97/28. 75/6.07 


6 | Recall 


Recognition 
1 16.25/7.21/31.25.4.30 


0/10 
Recognition 10/5,17.50/6.61 26.75/0.52 


MSE (1,56) = 22.57. 


connection with Experiment II. The 
only significant effects were those as- 
sociated with: Form Class (F = 37.57, 
df = 1/56, p < 0005); and Trials 
(F = 176.51, df = 1/56, p < .0005). 
The main effects of Grades and Re- 
sponse Mode each produced an F < 1, 
as did the interaction of Grades X 
Response Mode X Form Class. The 
interaction of Response Mode X Form 
Class also accounted for an incon- 
sequential portion of the total varia- 
tion, 1% (F = 1.18, df = 1/56, p > 
.05). Again, Experiment Ш affords no 
evidential support for the constraint 


hypothesis. 
Discussion 

The hypothesis of semantic con- 
straint is not sufficient to explain the 
facilitation of learning produced by 
verb connectives. In only one of the 
times that the basic experimental 
design was repeated did the results 
lend support to the constraint explana- 
tion. Even when the size of the class of 
appropriate responses Was equated for 
conjunction and verb conditions, verb 
strings produced substantially faster 
learning. Thus, the phenomenon re- 


mains to be explained. 
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fore dismissing the notion of 
E constraint, one of its hidden 
assumptions should be exposed and 
examined. For clarity, the learning 
process may be divided into an input 
phase, corresponding to the study trial 
in the present experiment, and an 
output phase, corresponding to the test 
trial. Semantic constraint, as presently 
formulated, should have its effect on 
performance during the output phase, 
that is, when S selects a response from 
among those available to him. It is at 
the time of recall that the verb might 
semantieally limit the number of 
alternatives from which a response 
must be chosen. This may be an er- 
roneous assumption. The principal 
locus of the effect of verb strings on 
learning may be the input phase rather 
than the output phase, just as varia- 
tions in the accuracy of recall can be 
localized in the phase of original learn- 
ing rather than in the conditions of 
recall (Olton, 1965). If this analysis 
were verified empirically, it would 
suggest that explanations of the facili- 
tory effect of verb strings should refer 
to conditions which affect the acquisi- 
tion process during the phase of initial 
input. 

Two other features of the present 
results deserve special emphasis. The 
first concerns the power of verb strings 
to produce high levels of performance, 
In each of the experiments performed, 
the presentation of the paired as. 
sociates in verb strings was sufficient 
to produce performance under condi- 
tions of recall that was as efficient as 
that produced by conjunction Strings 
under conditions of recognition. Praeti- 
cally speaking, verb connectives can 
reduce the difficulty of a recall task to 
that of a normal recognition task. 


Jn. AND STEVE LYNCH 


This may rightly be construed as a 
demonstration of the commonplace 
that the manner in which learning 
materials are presented to the learner, 
or the manner in which he operates on 
them, markedly affects learning effi- 
ciency. 

Finally, the results of repeated 
replication of the same basic design 
in the present effort demonstrate 
another commonplace, namely, the too 
infrequently considered caution that 2 
single experiment is not sufficient to 
establish a phenomenon nor to decide 
an issue. Had the present design been 
performed only once, on the sixth- 
grade sample in Experiment Il, I 
conclusion implied about the constrain 
hypothesis would have been diametri 
cally different from that which is nov 
strongly indicated. 


REFERENCES m 
JENSEN, A. R., & Rouwer, W. D., Jn. [o 
effect of verbal mediation on the learnt a 
and retention of paired-associates by i 
tarded adults. American um of Men 
Deficiency, 1963, 68, 80-84. (a 
Pes ig A. R, & Ronwer, W. D. Ја, 
Verbal mediation in paired-associate i» 
serial learning. Journal of Verbal bear 
and Verbal Behavior, 1963, 1, 346-352. ( A 
JENSEN, А. R., & Ronwer, W. D., JR- prr 
tactical mediation of serial and paire 
associate learning as a function of 28° 
Child Development, 1965, 36, 601-008. 
Otron, К. M. The effect of a mnemonic Ural 
the retention of paired-associate Ver? 
material. Unpublished doctoral dine id 
tion, University of California, Berkeley» 
1965. мА 
Роѕтмалм, L., & Rau, L. Retention as à e 
tion of the method of measurement. Pail 
versity of California Publicotions in 
chology, 1957, 8, 217-270. vin 
Ron wae. W. D., Jn. The verbal facilitation 
of paired-associate learning. Unpublis ү. 
doctoral dissertation, University of 
fornia, Berkeley, 1964. 


(Received October 11, 1965) 


{ 


Journal o; 
of Educational P: s 
1966, Vol 57, No. 5, Mag m 


SIZE-DISCRIMINATION LE 


ARNING AS A FUNCTION 


OF MOTIVATION-HYGIENE ORIENTATION 
IN ADOLESCENTS! 


H. CARL HAYWOOD anp 


THEODORE D. WACHS 


George Peabody College 


Adolescent Ss (N 


were selected for high-motivation or high 
ere given a 
9 out of 10 correct trials on 2 suc- 


the Choice-Motivator Seale, and w 
tion problem to а dual criterion of 
cessive days. The proc 
of average intelligence 
vation-oriented (MO) Ss learned 
hygiene-oriented (HO) Ss on soi 
groups, personality differences did 
in original acquisition, but MO Ss 
In the low-IQ groups, MO 8s requir 
HO Ss failed to learn t 


me 


Mls ти of intellectual develop- 
Promine unctioning, one of the most 
een ee sources of knowledge has 
ing in pot a of discrimination learn- 
ally ret dren, both normal and men- 
Siderin, "es (Stevenson, 1963). Con- 

еу he amount of work done wit 
hat во i learning, it is surprising 
pon — emphasis has been placed 
rocess he relation of this learnms 
Survey P personality variables. À 
the тине the literature from 1940 to 
esent revealed only nine studies 
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— 80) with below-average intelligence (IQ = 


edure was rep! 
(IQ = 105.5). At each inte! 
more efficiently than di 


he discriminatio 
еа 


70) 
-hygiene orientation using 
visual size-discrimina- 


eated for 80 similarly selected Ss 
Iligence level, moti- 
id IQ-matched 
of the scores. In the average-IQ 
not predict learning differences 
relearned faster than did HO 85. 
ed fewer total trials. More low-IQ 
n in the 200 trials given. 


ting the relationship of per- 
variables to performance in 
discrimination-learning tasks. Cor- 
responding to their prominence in other 
areas of learning, studies of manifest 
anxiety were most frequent (e.g., 
Farber & Spence, 1953; Hilgard, Jones, 
& Kaplan, 1951; Spence & Farber, 
1954; Stevenson & Iscoe, 1956). 
Studies have also related need for 
achievement and diserimination learn- 
ing (e.g. Estes, Miller, & Curtin, 
1962; Miller & Estes, 1961) as well as 
psychopathic tendencies, as measured 
by the MMPI, and performance in a 
conditioned-discrimination problem 
(Warren & Grant, 1955). 

Most relevant to the present study 
is the work of Terrell and his associates, 
who investigated the effects of varied 
motivational factors upon discrimina- 
tion learning (Terrell, 1958; Terrell, 
Durkin, & Wiesley, 1959; Terrell & 
Kennedy, 1957). According to Terrell 
(1958), middle-class children learned 
more efficiently under intrinsic motiva- 
tion than under extrinsic motivation; 
that is, these children learned for 
Jearning’s sake or for the achievement 
of learning rather than for material 
incentives. Lower-class children 


investiga 
sonality 
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learned more ve ме эма 
incentives. Although Terrell 2 
these differences to personality vari- 
ables, his discussion was post hoe in 
that there were no independent meas- 
ures of personality characteristics. 
Recent developments in personality 
research have indicated that Terrell 
may be quite correct in considering 
differences in children’s performances 
on a discrimination-learning task as 
being due to the type of motivational 
system (intrinsic versus extrinsic) that 
characterizes the individual child. 
The present research investigates the 
relevance of the motivation-hygiene 
concept formulated by Herzberg, 
Mausner, and Snyderman (1959) and 
extended by Herzberg and Hamlin 
(1961) to performance of children in a 
size-discrimination task. The motiva- 
tion-hygiene concept asserts that an 
individual can make two types of 
adjustment toward his environment. 
The first, a hygiene-oriented adjust- 
Ment, is extrinsic in nature: The 
hygiene-oriented (HO) individual is 
motivated by extrinsic environmental 
factors, for example, a need for secu- 
rity, a need for comfort, and the 
avoidance of stress and difficult tasks. 
The contrasting form of adjustment, 
the motivator-oriented adjustment, is 
intrinsic in nature: The motivator- 
oriented individual (MO) is motivated 
by intrinsic self-actualizing factors, 
for example, the desire for achieve- 
ment, responsibility, and interest in 
work, a high sense of function pleasure 
(Herzberg & Hamlin, 1961), and a 
tendency to approach tension-inducing 
situations. The approach tendency of 
the MO individual and the avoidance 
tendency of the HO individual have 
been confirmed by Haywood and 
Dobbs (1964). . 
If basic learning tasks are indeed 
sensitive to individual differences in 
personality characteristics, it would be 
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expected that the MO individual and 
the HO individual, because of their 
differing motivational systems, would 
perform differently on a discrimination- 
learning task. The HO individual, 
being less concerned with achievement 
and being less interested in the task 
than in the environment, would be less 
sensitive to cues in the task and mii 
willing to give up trying since he woul 
be getting little satisfaction from the 
task. The MO individual, being con- 
cerned with achievement in a b 
and being interested in the task itse 
because of his sense of function pleas- 
ure, would be more sensitive to cues E 
the task and more willing to keep Ог 
trying to solve the problem. Hence, 
it would be expected that the ar 
crimination learning of MO individua 
would be more efficient than would t! e 
discrimination learning of HO n 
dividuals. Р 
The present study has the secondary 
aim of exploring some of the environ 
mental variables which might be 
expected to maximize the effects © 
individual differences in motivatio T 
hygiene orientation. Working и d 
schizophrenic inpatients, Sandvo 
(1962) has shown that motivator orien- 
tation can be increased by adding tO 
the effort required to perform a tas : 
For Sandvold’s unimproved sehiz 
phrenics, the effort involved in the tas : 
had to be related to a goal in order fo 
the full benefits of an MO orientation 
to be realized. In the present study i 
degree of effort and relevance of enD 
to task solution, as well as intelligen® 
level, were systematically investigate 
to determine whether they interact 
with the motivation-hygiene orienta" 
tion in influencing speed of learning а 
а discrimination problem. While it W 
expected that MO individuals wou 
show faster learning than would . a 
individuals, no definite рше 
were made for effort, relevance, ОГ 
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Possibilities of a differential effect of 


Personali : lect о 
i at different levels of intelli- 


METHOD 
Subjects 


Sub; 
enar jests (80 were 160 adolescents at the 
School for сараки School, a residential 
in this sch epenident children. All children 
intelligence. are given the Lorge-Thorndike 
entrance e tests as part of the school's 
group wa Procedure. A high-intelligence 
dren ha з constituted from all those chil- 
the Poen an IQ of 95 or above on 
Eroup SES Tors kts a low-intelligence 
aving int constituted of those children 
he school Q of 85 or below. All children at 
who met these criteria were given 


Cra: TABLE 1 
Groups мо Ss | HOSs 
ba Intelligence 
IQ 40 40 
L CA 105.5 | 105.5 
Y Intelligence 14-8 14-1 
IQ 40 40 
CA 76.2 75.0 


the Cho; 
belo hoice-Motivator Scale to be described 


wW, 

Vator) ыу б scoring in either the top (moti- 
ifference bottom (hygiene) quartiles of 
total ofina Were used as Ss, making & 
children groups of Ss: high-intelligence 
Winterset high-intelligence НО children, 
ligence 59106 MO children, and low-in- 
iqUated i HO children. The groups were 
Окса] apo’ intelligence level and chrono- 
Bg rien as shown in Table 1. The Ss in 
One s were then randomly assigned to 
Cort taoa? experimental conditions: high 
Maz low elie high effort-task irrele- 
10° irreleya ort-task relevant; low effort- 
Ss ont. In each condition there were 

and 10 HO Ss. 


4p 
Par 
atus and Measuring Insiruments 


te 


ea The motivati : Á 
Ун l wtivation-hygiene orientation of 
ісе руг a8 measured by means of the 


1962), which Ator Scale (Hamlin & Nemo, 
consists of 20 paired vocational 
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choices. The S is instructed to indicate 
which of the pair he would rather be and 
then give the reason for his choice. The 
scoring is based upon systematic manifest- 
content analysis of the response given, and 
not the choice. The answers are classified 
into motivator responses (five categories), 
hygiene responses (six categories) or into 
ten other categories which are not used in 
the final scoring (Hamlin & Nemo, 1962). 
A difference score, obtained by subtracting 
the number of hygiene responses from the 
number of motivator responses, was used 
to classify Ss. 
The learning task was а size-discrimina- 
tion problem with form and position irrele- 
vant. The correct size, either large or small, 
was counterbalanced for each group. Stimuli 
were provided by two Inline Readout pro- 
jectors (X 10N83). Each projector contained 
four different stimulus forms in large and 
small sizes. These were combined to make 
stimuli, the stimuli within each 
different in both form and size. 
the type of stepping switch used, 
two of the pairs were randomly chosen to be 
repeated at the end of the series, making a 
cycle of ons of stimulus pairs 
peated itself. 

lus pairs in the cycle was 


The correct choice was either the larger or 
ding upon which 


smaller of the pair depen 


was designate! 

menter. The projectors were mo I 
jnches apart on the back of a 12 X 28 inch 
wooden panel which was itself mounted on 
a wooden stand. Windows were cut in the 
face panel so the projector faces could be 
flush with the face panel. Two 12 X 20 inch 


boards and a 12 28 inch board were placed 
i he panel to form an 


at the sides and top of t 
i hole apparatus was 


Between the windows 
was а small yellow 
h was lit when S made a correct 
ertrial interval Акыт: con- 
5 seconds, with the yellow light re- 
ped on for 3 of those 5 seconds, if S made 
a correct choice. The complete operation 


was electrically programmed. * | 
effort manipulation consisted 


The task- ffi 
of having each S press upon ^ pedal with 
his feet while being tested on the discrimina- 
tion problem. The pedal consisted of two 
boards, 9 inches square, which were painted 
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flat black and hinged at one end. The top of 
the pedal was covered by a thin rubber 
sheet. The amount of pressure needed to 
depress the pedal was dependent upon the 
number of springs inserted between the two 
boards. Up to six springs could be inserted 
with each spring being calibrated at 5 
pounds pressure. Depressing the pedal 
closed a switch located between the boards. 
This switch had connections leading back 
into the program board so that, depending 
upon the experimental condition S was in, 
depressing the pedal either started the dis- 
crimination apparatus (Task Relevant) or 
stopped a ringing bell located in the experi- 
mental chamber (Task Irrelevant). 


Procedure 


The discrimination apparatus and pedal 
were placed on a low table in a corner of the 
room facing a window which was covered by 
a black curtain. The program board was in 
an opposite corner behind a Plywood screen 
which was painted flat black. The S was 
asked to sit before the discrimination ap- 
paratus and the following instructions were 
given: 

While you're here, we're 

a couple of games. The first one is to see 

how strong you are. You press down on 

this pedal with both feet till you hear a 

click and hold it till I tell you to let up. 

(For each Successive depression of the 

pedal, E said: Good, now try it again.) 

Following these instructions, the experi- 
menter ascertained the maximum pedal 
weight S could depress. For high-effort 
Ss, 5 pounds was subtracted from this maxi- 
mum down to a minimum pressure of 15 
pounds. For those Ss who could depress 30 
pounds with little trouble the pedal was left 
at its maximum pressure, For low-effort 
Ss, the pedal was kept at 5 pounds pressure. 
Upon completion. of this procedure, the 
following instructions were rend to all Ss: 

This is a game in which I want you to 
choose between two shapes which will 
appear in these windows and press the 
button under the one you choose. You are 
to try to guess the right shape. If you press 
the button under the right shape, this 
yellow light will go on showing that you 
are right. If you press the button under 
the wrong shape, the light will not go on. 
Press the button only once, let off, and 
don’t press it again until some more 
shapes appear in the windows. The aim of 
this game is to see how quickly you can 
learn to choose the shape that makes the 
light go on. Any questions? 


going to play 
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For Ss in the task-relevant condition, the 
following instructions were then read: 

In order to play this game, however, 
You must keep your feet pressed down on 
this pedal. As long as this pedal is down 
the machine will stay on and you can try 
to light the light. If you let up on this 
pedal, the machine will Eo off and will 
stay off till you press the pedal down 
again. OK, you press down on the pedal 
and I'll go next door and start the game. 
For Ss in the task-irrelevant condition, 

the following instructions were read: 

While you're playing the game, how- 
ever, I want you to keep your feet pressed 
down on this pedal. If you let your feet 
up, a bell will start ringing to let you know 
that your feet are up and it will not stop 
till you press down again on the pedal. 
So keep your feet on the pedal while you 
try to light the light. You press down on 
the pedal now and I'll go next door and 
Start the game. 

Upon completion of these instructions, 
E left the chamber and went behind the 
plywood screen where he started the ap- 
paratus. As can be seen from this procedure, 
task relevance was defined in terms of 
whether the S's effort was necessary or not 
necessary to continue the task. р 

Each S was given up to 100 massed trials 
per session to a eriterion of 9 out of 10 cor- 
rect, on the same problem, for two successive 
sessions. Each S was given only one session 
per day. Until criterion was achieved, the 
correct size remained constant. Immediately 
upon the S's achieving criterion, the correct 
Size was reversed and S was run to а cri- 
terion of 9 out of 10 correct in the reversal 
Procedure. If S had not Solved either the 
original problem or the reversal by the end 
of 200 trials, he was terminated. At the be- 
ginning of each new Session, each S was 
asked if he remembered how to play the 
game. If he did not, the original instructions 
Were repeated to him. 


Analysis 


Four measures were obtained for each 
S: (a) the number of trials required tO 
reach the acquisition criterion of 9 out О 
10 correct for 2 successive days on the same 
Problem (total acquisition score; those 
Ss who did not learn the problem yen 
given a score of 200); (b) the number of triats 
required to reach the first criterion doring 
acquisition (original acquisition score; ony 
those Ss who could learn were included a 
this analysis); (c) the number of tria 
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requi 
Pr ne nee the second criterion, on 
the day i ER em, when S was brought back 
tion cuter reaching the original acquisi- 
those Ss m n (task carryover score; only 
this кке ш learn were included in 
criterion o)? (d) number of trials to reach 

he da ia тауап. 

intelligens obtained from the two levels of 
that each à ү analyzed separately, so 
evel of ince for a homogeneous 
except revertar oe For each analysis, 
sis of varin S ,12 X 2 X 2 factorial analy- 
Reversal d E (Lindquist, 1953) was used. 
analysis ere a were analyzed in a two-way 
Ovariance design (Winer, 1962). 


RESULTS 


L А 
ow-Intelligence Group 


сада бхресіей, MO 85 took signifi- 
acquisiti, trials to reach the total 
position criterion than did HO Ss 
Ana e df = 1/72, p < .05). 
Scores P pe of the total acquisition 
and tage 9 Wed no effect, of task effort 
ч for Е elevance upon rate of acquisi- 
group, n 5з in the low-intelligence 
With th or did these variables interact 
Ysis of y oec raa: dimensions. Anal- 
and tash 16 original acquisition scores 
Significant У over scores showed no 
5 and 3 differenees between MO 
ental ¢ Ss under any of the experi- 
9f the i Further inspection 
Ses were à indicated that these analy- 
Proporti € because of the high 
Teach qi A of НО Ss who failed to 
tateq a her criterion, which necessi- 
latter er being dropped from the 
Ss fae sis. While 12.5 % of the HO 
trials ed to learn the problem in 200 
failed only 2% of the MO subjects 
07. ? learn over this period (p < 
Be 
Vanes үе task effort and task rele- 
of ше to show any effect in any 
dimensi acquisition analyses, these 
s Versal qj were collapsed and the 
Way a ata were analyzed by a two- 
1965, ?nalysis of covariance (Winer, 
tota ец covariance adjustments for 
“quisition scores. This analysis 
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TABLE 2 
Means AND STANDARD DEVIATIONS ON ALL 
DEPENDENT VARIABLES FOR Low- 
INTELLIGENCE GROUPS 


MO Ss | HO Ss 


Measures 


Original acquisition: (Tri- 


als to criterion) 47.58 | 59.90 


(34.30)| (42.80) 
Task carryover score: (Tri- 


als to criterion) 19.26 | 23.97 


(21.38)| (31.18) 
Total acquistion: (Trials 


to criterion) 70.10 | 98.10 
(47.23)| (61.55) 

Reversal: (Trials to crite- 
rion) 51.97 | 32.50 
(55.34) (36.91) 


Note.—Original acquisition plus Task 
carryover score will not equal Total ac- 
quisition since the latter included all Ss, 
while only those Ss who could learn the 
problem were included in the two former 
scores. Parenthetical entries are standard 
deviations. 


showed that, contrary to expectation, 
HO Ss solved the reversal problem in 
significantly fewer trials than did MO 
Ss (F = 5.85, df = 1/66, p < .025). 
The various scores for the low-intelli- 
gence group are summarized in Table 2. 


High-Intelligence Group 
Analysis of the total acquisition score 
for Ss in the high-intelligence group 
showed no effects of the environmental 
manipulations and no significant dif- 
ferences between MO and HO Ss. 
Since there were no differences in the 
proportion of MO and HO Ss who 
failed to learn (5% and 2% respec- 
concluded that the 


tively), ib was cont. x 
analysis of the original acquisition 


and task-carryover data would not be 
invalidated by the necessity of drop- 
ping Ss in only one group, as happened 
in the low-IQ analysis. Analysis of the 
original acquisition score showed no 

ort or task relevance and no 


effect of eff 
differences between MO and HO 65; 
however, on the task-carryover score, 
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TABLE 3 
MEANS AND STANDARD DEVIATIONS ON ALL 
DEPENDENT VARIABLES FOR HIGH- 
INTELLIGENCE GROUPS 


Measures MO Ss | HO Ss 


Original acquisition: (Tri- " _ 
ls to criterion) 44.75 | 45.10 
ain (33.53)| (36.31) 
Task carryover score: (Tri- 
als to criterion) 11.30 | 19.65 


(5.46)| (21.93) 


Total acquisition score: 
(Trials to criterion) 63.32 | 64.82 
(47.80)| (47.25) 

Reversal: (Trials to crite- 
rion) 26.21 | 26.94 


(34.51) (36.84) 


Note.—Original acquisition plus Task 
carryover score will not equal Total ac- 
quisition since the latter included all Ss, 
while only those Ss who could learn the 
problem were included in the two former 


Scores. Parenthetical entries are standard 
deviations. 


MO Ss, when brought back to the 
same problem on the next day after 
reaching original criterion, took signifi- 
cantly fewer trials to reach criterion 
again than did the HO Ss under the 
same conditions (F = 5.26, df = 1/72, 
р < .025). Again, because effort and 
relevance showed no effects, these 
dimensions were collapsed and the 
reversal data were analyzed in a two- 
way analysis of covariance (Winer, 
1962) with covariance adjustments for 
total acquisition scores. No significant 
differences between MO and HO Ss on 
reversal were found (F = .008, df = 
1/72). The various scores for the high- 
inteligence group are summarized in 
Table 3. 


Discussion 


Individual differences in motiva- 
tional system were found to have a 
significant effect on some of the meas- 
ures of discrimination learning, both 
in high-intelligence and low-intelli- 
gence groups. This was so in spite of 


the fact that the task, size discrimina- 
tion, while adequate for the low- 
intelligence group, proved to be too 
easy for the high-intelligence group 50 
that a ceiling effect was encountered in 
this group. The environmental vari- 
ables (task effort and task relevance) 
did not interact with the motivation- 
hygiene dimension. The failure to find 
interactions with the environmental 
variables offers at least three possibili- 
ties of explanation: (a) differences in 
the populations sampled in the present 
study and in Sandvold's (1962) study; 
(b) the possibility that the effect of 
the environmental variables is not а 
real effect at all; and (c) the possibility 
that the present operations for effort 
and relevance may not be sufficiently 
strong to elicit measurable effects. It 
is still possible that under conditions 
of higher effort, these variables might 
show an interactive effect. 

While there were no differences 
between MO and HO Ss on the high- 
intelligence level under conditions O 
total acquisition, original acquisition; 
and reversal, the MO Ss showed cleat 
superiority under the task-carryoVer 
condition. Since the mean scores of the 
MO and HO groups on the measures 
revealing nonsignificant differences 
were in the predicted direction, it 18 
possible that with a more difficult task 
the MO Ss would also show superior 
acquisition. An alternative explanatio 
is that with Ss of average and above- 
average intelligence, the motivation® 
system would not affect original lear? 
ing; however, since MO Ss are more 
interested in learning and more task- 
oriented (by definition), they would be 
expected to pay more attention 9 
relevant cues and thus to be able tO 
recall or resume a previously attempte 
task. This notion could be tested Ld 
use of a task more suitable to t! 
intelligence level under study, 1 
Which both acquisition and rec. 
would be scored. If this reasoning > 
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ien there should be no differences 
bus MO and HO Ss on original 
Pes but the MO Ss should be 
prese = on recall (as they were in the 
rise рч study) ; however, if our lack of 
ceilin lon differences is due to a 
sul ti dun in the present task, it 
aeg € expected that MO Ss would 
recall yaad in both acquisition and 
these ; he present data cannot answer 
^h questions. 
HO Ss superior learning of MO over 
acquisiti the low-IQ sample in total 
the lon scores supports the utility 
in stud motivation-hygiene dimension 
especially = basic learning processes, 
n the шош mentally retarded Ss. 
acquisiti ow-IQ group, the original 
in the lon and task-carryover scores 
Sidereq present study were not con- 
Proporti valid because of the high 
at all Th of HO Ss who failed to learn 
about th his result raises a question 
обу, 28 mature of the affects of the 
ell be lon-hygiene process. It may 
tained asked whether the results ob- 
omana due to the superior per- 
Performe, of the MO Ss, the very poor 
ination at of the HO Ss, or a com- 
Proporti of both. The fact that a high 
t bun 4 of the HO Ss were unable 
port ^. he task at all gives minimal 
S o the second hypothesis. 
Survey pregant study, as an exploratory 
Vationa] the effects of differing moti- 
Orman, Systems upon learning per- 
For oit has yielded promising data. 
Intelligon low-intelligence and high- 
ndings ce groups, the significant 
tion-hy Were as expected from motiva- 
Suggested io theory. The findings also 
thi 5 at least two new studies in 
replicate 3 studies designed both to 
Explore ai present findings and to 
е mot; e possible means by which 
their нц ational systems studied have 
Present Pes upon behavior. The 
Бону А suggest that it will be 
Process з, ; in the study of the learning 
8, to pay more attention to those 


Sup 
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a variables that can be 

psye jometrically indexed, thus making 

experimental capital of the fact of 

individual differences, rather than 

wishing they would go away. 
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GROUP STRUCTURE AND PRODUCTIVITY IN 
PROBLEM-SOLVING: 


BRYCE B. HUDGINS ax» LOUIS M. SMITH 
Washington University 


The 4 experiments were conducted i 
from middle and lower social classes 
problems. General support was found for 3 
hata ope to problems are not better than the indepe 

y the most able member of the group if he is perceived t 
able. (b) In arithmetic, when the able group membe: 
as most able, the group will do better than the 
m There will be a shift in the group's percep 

igh-ability member if the group's scores are not 


dividual's. One of the most interesting and unan 
sk where the groups had lower scores 


occurred with the social studies ta: 


n which elementary school children 


solved arithmetic or social studies 
hypotheses: (a) Group 
ndent solutions 
o be most 
r is not perceived 
high-ability member. 
tion of a low-status 
better than the in- 
ticipated findings 


than the high-ability member working alone. 


———— 


wage research reported in this paper 
Some of bare is and elaboration of 
udgin he findings of an earlier study 
Which ES. 1960). In that investigation, 
of aug а to examine transfer 
tom gro netic problem-solving, skills 
Was се to individual situations, it 
Worked = that subjects who had 
lems mo Е groups did not solve prob- 
individual effectively in a subsequent 
нт setting although the groups 

е eie E produced more cor- 
Viduals инс {һап comparable indi- 
Servation orking alone. Informal ob- 
groups supr ed that within the 
Success į jects met with differential 
Problem n having their solutions to 
Parent] S accepted, this success ар- 
ascribed depending upon the status 
of the to them by the other members 
data Жш Later analysis of the 
Ing toe that the problem-solv- 
Correlated of the groups were highly 
ita " with the performance of the 

€ control subject. Two of the 


Centr 

a . 

Seargh qivestions of the present re- 
thi eveloped from an analysis of 


S ear 

a study; these were hypoth- 
ne n 

with ни reported herein was con- 

Den, 996 Unites to the terms of a contract 

fap tment ed States Office of Education, 
* (Надо; of Health, Education, and Wel- 
Sins & Smith, 1962). 
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eses relating the productivity of the 
group to its most able member, and 
to the accuracy with which his ability 
is perceived by other members of the 
group. An additional hypothesis was 
formulated as the  experimenters 
sought implications of the first two 
for the functioning of small problem- 


solving groups. 
THEORY AND HYPOTHESES 


Theoretically, an understanding of 


these questions required a number of 
concepts; these are listed and defined 


briefly below: 
subjects (H-Ab Ss): 


High-ability 
group members who are demonstrably 
superior on abilities relevant to the 


task. . 
Average-ability subjects (Av-Ab 
Ss): group members who are demon- 
strably less able than H-Ab S but who 
have some ability relevant to the task. 
arithmetic 


Task: school problems, ari 
and social studies, graded in difficulty. 
Status: perceived ability relevant to 


the task. me 
The reasoning which links our con- 
follows: assume that 


cepts proceeds as 
an ideal group of three persons 1s com- 


osed of one H-Ab S and two Av-Ab 
Ss, Also assume that the task of the 
group is à series of problems perfectly 
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scaled for difficulty, so that the ms 
item is of minimum difficulty, eac 

subsequent item increases d 
in difficulty to the last, which is o 

maximum difficulty. In such a group, 
each item which can be passed by Av- 
Ab S can also be passed by H-Ab S, 
but the converse is not true. Suppose 
that the group is given a series of 
problems to solve, scaled as described 
above, and designated a, b, c, --- f 

- n. Problem f represents the upper 
limit of Ау-А S's ability, and m, 
which is theoretically at least several 
steps advanced in difficulty, the upper 
limit of ability for H-Ab S. Then, any 
one of the three group members can 
correctly solve problems a through f, 
but only H-Ab S can function effec- 
tively beyond that point. 

Assume that environmental condi- 
tions demand concensus about a solu- 
tion before it can be accepted. An 
unusual and a quite difficult role is im- 
posed upon Av-Ab $. It is essentially 
a role which involves allowing H-Ab S 
to make his maximum contribution, 
but at the same time imposing upon 


him the obligation to demonstrate the 


adequacy of his solutions, Conversely, 
H-Ab S is in the difficult situation of 
having more of a contribution to make 
and with individuals who do not have 
ability to judge his contribution. 

An intriguing question concerns the 
recognition of the most capable mem- 
ber of the group (H-Ab S). Drawing 
upon the earlier illustration, when 
Problem g and subsequent problems 
are reached, Av-Ab Ss have no way of 
proposing workable solutions, yet H- 
Ab S has not reached the limits of his 
ability. When he suggests a solution, 
what conditions are likely to influence 
Av-Ab Ss’ acceptance of it? First, ac- 
ceptance of his solutions should come 
more readily if H-Ab S had high status 
prior to the formation of the group; 
that is, if Av-Ab S has had earlier 
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acquaintance with him as a ne 
student. Second, if H-Ab S is not e 
tially perceived accurately, vae ee 
tions should meet with diminis jn 
resistance, perhaps merely Lagoon 
is the only member who can prop ly 
reasonable solutions to peri! 
difficult problems. The probabi ito 
that Av-Ab S will accept a solution 
a problem in the g to n range Ис © 
seem to be controlled by the ыш 
procedure he employs in evalua ple 
solutions. Since Av-Ab S is not & di- 
to evaluate H-Ab S's БОД sige їв 
rectly, that is, on their merits, he 
likely to resort to a Lec ie ае 
Seems reasonable to anticipate lu- 
Av-Ab S is more likely to accept 4^ 8 
tions offered by H-Ab S when ally 
latter is perceived as one who p.m 
knows the right answer to probleme. j 
When H-Ab S is clearly percer up; 
as the most able member of his o d 
one would expect the productivity о? 
the group to approximate the pro the 
tivity of H-Ab 8 closely, on H- 
grounds that Av-Ab S recognizes = 
Ab S's superiority on the task dim nis 
sion. For each of the four experi 
reported in this paper, the first ple 
pothesis reads: When the most ge 
member of a small problem-solvi^ 
group is also a high-status membe” ү, 
the group, the productivity of. ity 
group is a function of the productiv i- 
of the high-ability member. ed 
tionally, this means that no stati m- 
cally significant difference in the ae" 
ber of problems solved will pot S 
tween the situation where So a 
works alone and that in which he } 
member of a small group. га 
If the high-ability member 0. ү 
group is of low status, however, ost 
is, if he is not perceived as the pon 
able member of the group, the pilit 
tionship between the high-a f the 
member’s productivity and that o ж 
group described in Hypothesis 1 
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n i 
M 1f it does hold, there should 
ability ge in the status of the high- 
accord goo such that he is now 
mates слер status by his group 
the ex, hus, Hypotheses 2 and 3 of 
tion Шеп bear а contingent re- 
The lp to each other. 
^ uo d hypothesis states: When 
Beet: able member of a small prob- 
member "t group is also a low-status 
o 9 the group, the productivity 
pro db is not a function of the 
ber ШОУНУ of the high-ability mem- 
o Kus data ed this is interpreted 
cantly n lat the group will be signifi- 
ability м productive than the high- 
¿ect the ut low-status member. In ef- 
indicate а wil demand that he 
thereby Ee solution processes and 
ing hügex a checking and correct- 
son jig „Рге@1ойор in Hypothesis 3 is 
Secong je upon the outcome of the 
Sis state Ypothesis, The third hypothe- 
that is A If Hypothesis 2 is retained; 
than dg the group is more productive 
9f the ^ most able member, the status 
Change Jin-ability member will not 
that is, d Hypothesis 2 is rejected ; 
between Significant difference results 
Producti group productivity and the 
Status wi of the able member, his 
Derations change from low to high. 
that, in nally the hypothesis means 
ow ‘spn е first case, H-Ab Ss of 
tions tum Will receive fewer nomina- 
Most н, their group mates as the 
th pete member of the group 
S wi -status high-ability mem- 
s In кы from their group 
T e Where e second case, that is, the 
18 original Hypothesis 2 is rejected, 
пелет i) low-status high-ability 
Productive ould be nominated as most 
leh € about as frequently as 


1-5 
Rated” 


The atus members are so nomi- 

‚ The t 

"eq E and the hypotheses de- 
m it are built on the assump- 


tion that the tasks employed are 
perfectly scaled for difficulty. In Ex- 
periments 1A and 1B, arithmetic 
problems are used because, of all the 
curricular areas of the elementary 
classroom, arithmetic tasks probably 
come closest to meeting this assump- 
tion about scaling. But it seems also 
of interest and importance to know 
how significant this assumption is to 
the theory. If the theory accounts for 
group functioning when the assump- 
tion of scaling is fairly met, is it also 
useful when one can argue on & priori 
grounds that this assumption is vio- 
lated, or at least met less well than in 
the case of arithmetic problems? 
Since curricular areas such as the 
social studies and language arts prob- 
ably depart more from this assumption 
than do arithmetic and perhaps 
science, it seems to be important to 
know something about the generality 
of the theory to other areas. For that 
reason, Experiments 2A and 2B were 
conducted with social studies tasks. 
Similarly, educational theory often 
has been extended uncritically to a 
variety of environmental contexts. 
Our interests in the broader urban 
community suggested that replications 
between lower socioeconomic schools 
and middle socioeconomic schools 
would be fruitful. Consequently we 
selected children from these two areas 


of the community. 


METHOD 


Sample 
the hypotheses stated 


roups containing three 

omposed. Each group 

thmetic or a social 
s given el Ц > 

Boot с with the instruction to solve 

lems as possible. 


sroups were d 
group: neighborhoods and 24 from 


middle-class c 
schools in lower-class neighborhoods. Half 
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S in each socioeconomic school 
eod m ce on arithmetic, and half on 
social studies. The grade levels of the groups 
ranged from fifth to eighth grades. Š 

All the groups contained one Н-АЬ 
and two Av-Ab Ss. Within each socio- 
economic area, 12 of the groups contained 
H-Ab Ss who were perceived as such by 
their peers. The H-Ab Ss in the other 12 
groups were not recognized by their group 
mates at the outset as superior in task- 
relevant ability. 


Constituting the Groups 


All three members of each group were 
drawn from the same classroom, То con- 
stitute the experimental groups, pupils in 
participating classes were administered the 
Arithmetic Reasoning section of the Stan- 
ford Achievement Test, Form J, or the 
Sequential Tests of Educational Progress, 
Social Studies, Part One, Form 34. A variety 
of sociometric data was collected at the same 
time. For the experiment attention centered 
on nominations by classmates of the "three 
or four pupils in this class who are best in 
arithmetie (social Studies)" and the "three 
or four pupils in this class who are poorest in 
arithmetic (social Studies)" 

For cach class Separately, means and 
standard deviations Were computed for the 
test scores and the sociometrie nominations 
on ability. First, in order to qualify as an 
H-Ab Sa pupil had to sco 
test one standard deviatio 
the mean for his class, 
mean were 14,75 
were 3.25 
or more to be identifie 
Av-Ab Ss in each class wi 
approximately at the mean, In this illus- 
trative case, a pupil with а score of 14 or 15 
would be so designated, 

Second, whether a pupil identifie 
ability test score as suitable woul 
cluded in the study depended upon meeting 
an additional criterion. The Sociometric rat- 
ings of potential H-Ab Ss were first in- 
spected, If such a pupil received sufficient 
nominations as a good student to place him 
one standard deviation above the mean for 
his class, he was identified as a high-status 
H-Ab S. H-Ab Ss who were rated below the 
mean were deseribed as low status. Pupils 
who met the first criterion (test score) but 
not the second were dropped from the sam- 
le. P 
| To be included in a group with a high- 
status H-Ab S, the Av-Ab Ss must have 
nominated H-Ab § as one of the most able 


if the class 
rd deviation 
score of 18 
d as H-Ab S. The 
ere those who scored 


d by his 
d be in- 
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pupils in the group. For inclusion ш de 
with a low-status high-ability member, os 
Av-Ab Ss must not have given him & Pee 
tive nomination. In a few instances dio 
nominated him negatively, although 
criterion was not met rigorously. 
Procedures 


its 

Each small group was removed паш ja 
regular classroom to a different loca uld e 
the building where the experiment peo" 
conducted without interruptions. З ЫЬ, 
were provided with experimental mai k to- 
and were told that they were to ай у. 
gether to try to solve the problems cor their 
The only restriction imposed чре "ro an- 
method of operation was that a ane upon 
swer to each problem was to be agree pro- 
by all members of the group eo th of 
ceeding to the next problem. One oF б ses- 
the authors was present at each ШОН n 
sion. Each session was tape Pee 
addition, the experimenter kept ru abers- 
notes on the behavior of the group mem with 
The procedure followed was the вале ше 
all the groups, regardless of task or str 
With regard to H-Ab S's status. к b- 

The taak was a set of 30 arithmetic Prt. 
lems (Stanford Achievement Test, E 
metic Reasoning, Form K) reproduce о 
mimeograph to remove the impressio jtems 
Standardized test? or 35 social studies STE 
taken from Part One, Form 3B of the 
Social Studies, 


No specifie time limit was set. 
groups obviou 


Sino 
e 
isly require longer to comple 
Such a task than individuals do, it oduc" 
tended to allow the groups as much p erage 
tive time as they required. On the av 
Stoups consumed about an hour. " 
close of the experimental session, ea¢ wh 
ject was asked to “name the one Perm right 
helped the group the most to get ШЕ wish 
answer, You may name yourself if Y vately» 
to do so,” Responses were written Phe ex^ 
and were collected immediately by that 2° 
Perimenter, Subjects were assured amet 
one else would learn whom they had n 


REsuLTS 


rately 
The results are reported separi the 
or each of the four replications 2 
asic design. The replications a! 
b 


ane 
eled Experiments 1A, 1B, 2A; 
2B. 


e 


; missio? 
" Reproduction was done with per New 


from World Book Company, Yonkers, 
York, 


Group STRUCTURE AND PROBLEM SOLVING 


Experin 
eriment 1A  (Arithmetic—Hi 
0cioeconomic Status) A 


H 
tivi eg “ыш 1 states that the produc- 
group in w a small problem-solving 
is of hig} which the most able member 
е р | status will be no greater than 
member. um of that individual 
to test t] іх groups were constituted 
Score of i hypothesis. The pretest 
Vidual pr -Ab S represents his indi- 
duetivit roductivity. The group’s pro- 
o рй, is represented by the number 
һе tien en solved cooperatively by 
саве thee Pupils in the group. In this 
Status НАР, pretest score for high- 
average E b Ss was 19.83, and the 
test of t} group score equal to 19.67. A 
be Nos | significance of the difference 
&tof1 pa COMME means resulted in 
5 df, this y For a two-tailed test with 
nificance ms does not approach sig- 
he e. he hypothesis is retained. 
n the m hypothesis predicts that 
Up will | -Ab S is of low status, the 
ividual be more productive than the 
ated, the oo S. As we have indi- 
essential] asis for this prediction lies 
in tte Ge Taat that Av-Ab Ss, 
about 4} likely to be more dubious 
er wh he solutions offered by à mem- 
able, S they do not recognize as 
Checks = impose more rigorous 
high-stats H-Ab S than they would а 
each of the member. In this case, if 
SPective ss members plays out his re- 
igher role, the result should be а 
duce poor? than H-Ab would pro- 
e b»; Ашы The pretest scores of 
Ment ha atus H-Ab Ss in this experi- 
н арш of 19.17. The mean 
ems E was 20.67, just 1.50 prob- 
erence d on the average. The dif- 
Pin meee mee individual and 
of ә. neans yields a Ё of 1.97. At 
the .05 in required for significance at 
test, Е with 5 df for a one-tailed 
Slight though the obtained £ is 
the a than the required value, 
ors’ disposition is to retain 


Whe 
gro 
ind 
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the hypothesis; that is, to conclude 
for this replication that the groups 
were more productive than the indi- 
vidual H-Ab Ss. This interpretation 
is supported by the results of the next 
hypothesis, and by the later diseussion 
bearing upon the relationship between 
the status of H-Ab S and the role he 
plays in the problem-solving group. 
As explained earlier, all subjects 
were asked to indicate, immediately 
following the experimental session, 
which of the members contributed 
most to the group’s success. These 
nominations were used as indications 
of the postexperimental status of H-Ab 
S. One prediction made by Hypothesis 
3, and the one which seems appropriate 
es that if the groups are more 
e low-status H-Ab 
atus should remain 


here, stat 
productive than th 
Ss, a difference in sti 

between the high- and low-status H- 
Ab Ss after the experiment. The high- 
status H-Ab Ss received mean nomina- 
tions of 1.83 (of a possible 2; 
self-nominations by H-Ab S were 
ignored) as most responsible for their 
group's success. The low-status H-Ab 
Ss received only .67 nominations on 
the average. This difference is highly 
significant t = 580, df = 11 for a 
one-tailed test; P e ЇЙ.) 

Experiment 1B (Arithmetic—Low 
Socioeconomic Status) 

For the lower socioeconomic status 
groups, the results of the first hypoth- 
esis are the same as those reported in 
Experiment 1A. The mean score for 
H-Ab Ss is 16; for the groups it is 17. 
This mean difference yields a non- 
significant t of 1.37. The groups do not 
differ in productivity from the able 
members, and the hypothesis is re- 
tained. 

In the test of the second hypothesis, 
although the groups solve on the aver- 

blems more than the low 


age 1.17 pro 
status H-Ab Ss alone (17.67 problems 
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Г for the groups compared with 
[neo the H-Ab Ss), this difference is 
not significant. The £ value is 1.80. 
The prediction for Hypothesis 3 thus 
is that the postexperimental status of 
low-status H-Ab Ss will not be differ- 
ent from that of the high-status mem- 
bers. A mean difference of .17 exists 
between the two sets of scores. This 
difference favors the high-status mem- 
bers. They receive 1.67 nominations 
compared with 1.50 for the low-status 
H-Ab Ss. However, the difference is 
not significant. The t of .42 suggests 
that the obtained mean difference be- 


tween the two groups is not a reliable 
one. 


Experiment 2A (Social 


Studies—High 
Socioeconomic Status) 


In the higher socioeconomic status 
groups, the high-status H-Ab Ss in- 
dependently solved an average of 29.33 
social studies problems. For the 
groups, the mean score was 27.17. This 
mean difference is not Significant, Tt 
yields a t of 2.38 with 5 df. 'The hy- 
pothesis is retained. 

In testing Hypothesis 2, when the 
Same comparison is made between 
low-status H-Ab Ss and their groups, 
the result is similar to that reported 
above. The H-Ab Ss solve 29.17 prob- 
lems on the average; their groups only 
27.33. A t of 1.81, however, based upon 
the mean difference, is not statistically 
significant. The hypothesis is rejected 
since the groups are not more produc- 
tive than H-Ab Ss. 

The prediction in Hypothesis 3 is 
that the postexperimental nominations 
to high- and to low-status H-Ab Ss 
will not differ. The high-status mem- 
bers receive 1.5 nominations on the 
average, compared with 1.4 for the 

low-status members. This difference 
yields a nonsignificant Ё of .21, and 
the hypothesis is retained. 
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" Q "ox —L0Ww 
Experiment 2B (Social Studies—L 
Socioeconomic Status) 


; us 

The pretest scores of the Poem 
H-Ab Ss have a mean of 22. 4 The 
mean for their groups is Y i ee 
difference, resulting in a £ of . e ss fe 
significant. The first hypothesis 

in d. re 
en the high-ability members "E 
of low status, their group $ pee em 
19.50. For the high-ability mem)? 
the mean is 21.17. The alne г" de 
this comparison is 1.39, which thesis, 
significant. The second рае 
that the groups produce more high- 
solutions than the low-status 
ability members, is not Cael 

The high-status members from 
1.33 nominations on the үс 
their group mates as most rs low- 
for the success of the group. 0 nomi- 
status subjects receive only 5 ig cases 
nations. The prediction in thi nomi- 
Hypothesis 3, is that the ше the 
nations given to the high- 4 ot be 
low-status H-Ab Ss should p other: 
significantly different from ben is 0 
For this comparison, a Ё of um d 
tained, which, for a two-tal P. е, 
with 11 df, is not a significant cance 
A t of 2.20 is required for signi 


г hesi* 
at the .05 level. The third hypo 
is retained, 


Discussion 


ro" 

The results of the experiments Bory 
vide reasonable support darle first 
and hypotheses elucidated in t tatio” 
part of this report. The interpre i re 
is perhaps clearer and more nme ; 
hensible with respect to the ari udie? 
experiments than the social v the 
experiments. Two major aspects 180087 
Studies seem to require some to d? 
Sion and explanation. One aede j 
With the change in status of low ela” 
H-Ab Ss. The other concerns the "° g- 


о] stu 
tive unproduetivity of the social 
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ie 
cr ae Let us consider them in 
"esu concern regarding status 
bue or lie with the sociometric 
Wr oru HE ШЕ perceived abil- 
Paper (Hu. to the task. In an earlier 
1962) w. a Smith, & Johnson, 
Sessed eli ound that the measures pos- 
ЧЧ iability. A later investigation 
that th s & Loftis, 1966) indicated 
Wang eee See low- 
eract, ma р pupil, does not in- 
less Gen the teacher significantly 
Pupils) visible children (high-status 
action oe does he initiate less inter- 
Visible = teacher than does the 
classroom, in the ongoing regular 
dins ies when a small group of 
gaged d school children are en- 
they mu cooperative problem-solving, 
each ms exchange information with 
response 4 about the probable correct 
these e to each problem. Through 
Sumab] м зані, the children pre- 
Of the y ecome aware to some extent 
group А. epa they and their 
Operation. T make to the total group 
er hat is not to say that any 
е role iy can explicitly identify 
simpl contributions of the others, 
Make ak that, if asked, they can 
Toles доме judgment about the 
&roup, yed by the members of the 


It 
status I PDA be expected that 8 high- 
tribute igh-ability member would con- 
9f the a lion’s share to the functioning 
his im d He is already known to 
he Кее as the most able of 
lion to ue makes a central contri- 
Should ғ e group problem-solving, 
Corded i follow that he will be po 
Contribution. riate recognition for his 
ut : 
r “he. of the low-status H-Ab S? 
Te clo ing suggests that he may be 
Sely examined by his group 
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mates before they are willing to accept 
his solutions, and by this means the 
productivity of the group may be en- 
hanced. However, since one or both of 
the Av-Ab Ss must play a significant 
role to bring about this enhancement, 
it is not so clear that the role of the 
H-Ab S in this situation is the central 
one. Hypothetically when the group's 
productivity exceeds his, the postex- 
perimental status of the low-status 
individual will not rival that of his 
high-status counterpart. If, however, 
the low-status member plays a more 
important role in furnishing solutions 
to problems, the score of the group 
should be similar to his own, and his 
postexperimental status should now be 
about the same as that of a high-status 
member. That is, there should be con- 
gruence between the roles played by 
the high-ability members and their 
group mates’ acknowledgments of 


those roles. 
To specify 
contribution to 
experimental status 0: 
member as carefully as possible the 
protocols made during the group ses- 
Such protocols 

were av f the total 48 
groups. For this analysis, the concept 
“eontribution to the group” was de- 
fined operationally as “offering the 
solution to à problem which was ac- 
cepted by the group as the correct 
n` Two roles were identified: 
that of the H-Ab 8, and that of the 
de the larger number 

of contributions, and who is subse- 
to as the rival Av- 


this relationship between 
the group and post- 
f the high-ability 


Ab S. : . : 
In light of the previous discussion, 
the first comparison of interest is that 
between the high-status and the low- 
status H-Ab Ss. Recall that in Experi- 
ment 1A, which was an examination of 
oblem-solving with higher 
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TABLE 1 
н RR /AL 
COMPARISONS OF CONTRIBUTIONS TO GROUPS BY Нісн-Авплтү SunjiECTS AND RIV 
TOS AVERAGE-ABILITY SUBJECTS 


H-Ab 5 Rival Av-Ab 5 t 
Experiment (М No. of (М no. of SDa 
solutions) solutions) 
1A: Arithmetic—high socioeco- 
nomic schools ө 44** 
High status 14.00 4.67 2.71 EE 
Low status 10.83 10.50 3.01 А 
1B: Arithmetic—low socioeconomic 
schools 30** 
High status* 17.25 7.00 3.02 pm 
Low status 12.25 5.00 2.18 3.35 
2A: Social studies—high socioeco- 
nomic schools _ 
High status 18.33 10.33 3.68 2 17 
Low status? 16.00 13.00 3.51 
2B: Social studies—low socioeco- 
nomic schools » 
High status 13.33 12.83 1.84 „27 
Low status 13.00 13.83 3.77 .22 э 
a Based on analysis of 4 groups. 
> Based on analysis of 5 groups. 
* P < .05 (1-tailed tests). 
** p < .01 (I-tailed tests). 
Socioeconomic stat : ioni к а in the 
Status groups, the Significant difference occurre 
groups were no 


J more productive than 
the high-status members, they ex- 


ceeded the low-status members in pro- 
ductivity, and a significant postexperi- 
mental difference resulted between the 
two types of H-Ab Ss, as predicted, 
When the contributions of the H-Ab 
Ss are examined, it is revealed that the 
high-status subjects contributed an 
average of 14 solutions to their groups, 
Low-status subjects contributed 10.83, 
on the average, to their groups. This 
mean difference is not significant, With 
a standard error of 2.82, the t value is 
only 1.12. Thus, contribution to the 
group, defined as number of accepted 
solutions offered, does not seem to ex- 
plain the residual status differences 
between high- and low-status H-Ab 
Ss. 


In the three other experiments, no 


o 
Postexperimental status of the ies 
H-Ab S groups, Analysis of the рг0 {0 
Cols with respect, to contributions y- 
group problem-solving by the eis 
ability members tends to confirm, ree 
lack of difference. None of the сон i 
comparisons is statistically signifi S 
For each experiment, the Av- bur 
who makes the more frequent cont”? the 
tion can be considered a rival econ” 
H-Ab S. Presumably the closer his the 
tributions rival those of H-Ab 5; iv 
fewer nominations H-Ab S will Phe re^ 
as most helpful to the group. ae те“ 
Sults of these comparisons are 
Ported in Table 1, Notice that mave 
results of Experiment 1A, the ctlY 
Av-Ab Ss contribute almost y ue 
35 many solutions as the low-§ pot! 
H-Ab Ss do. This is also true for nic? 
Comparisons in Experiment 2B, 


he 
Ee 


GROUP STRUCTURE AND PROBLEM SOLVING 


is а ie 
pde * een of Social Studies 
groups. ower socioeconomic status 
acie hs do not allow an adequate 
the soci; pe of the unproducüvity of 
ing int a Pa in groups. Before enter- 
Sults of. ee discussion of the re- 
it seems idee studies experiments, 
Point Bi visable to make a general 
group, wout the performance of the 
1 most orking on social studies tasks. 
rectly Cosi the groups solved cor- 
S had er problems than the H-Ab 
at e when working alone. 
even 8 true for 16 of the 24 groups. 
higher groupe earned scores slightly 
ne com an the able member, and for 
ence, т гөп, there was no differ- 
of this e most obvious explanation 
Seem to [pae difference would 
themsely € associated with the tests 
Mental ү es which were used as experi- 
of the oro That is, if Form 3B 
€ ГЕР Social Studies Test (used 
Form s was more difficult than 
the diff, which was used as a pretest, 
terms re would be explicable in 
culty, H differential levels of diffi- 
basis a Eh there seems to be little 
raw m Such an explanation. Equal 
Converted» yield virtually identical 
est useq Scores for both forms of the 
ational Ton these experiments (Edu- 
le item ae Service, 1957). When 
8 ifficulty levels are examined, 


ficult on Form 3B, and 17 on 
Чопа] Т, are more difficult (Educa- 
all or ү Service, 1957). Since 
all the experimental groups tried 
items, it seems unlikely that 
forms T arability between the two 
the dto: the test could account for 
6 Fes differences. 

task vali m do suggest that the 
uires Pe oe in problem solving re- 
Ypie co alysis in terms of more geno- 
ferences УГО. For example, the dif- 

etween the two sets of 


lack of 
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experiments may be attributable to the 
more explicitly taught responses in 
arithmetic as opposed to social studies; 
or to special demands upon reading 
skills required in order to operate 
effectively with the social studies 
tasks. 

The problem-solving model upon 
which this work is based suggests that 
the perceptions of subjects about the 
group tasks may be an important lead 
for future inquiry. That model not 
only assumes that tasks are perfectly 
scaled for difficulty; it presupposes 
that subjects’ perceptions of the task 
as a scale are veridical. If the assump- 
tion of veridicality is untenable, as it 
very likely was in the social studies 
experiments, group productivity 
should be diminished accordingly. 
That is, in tasks where subjects do not 
perceive their limitations, they are less 
likely to be amenable to accepting 
solutions of others, because the faulty 
perception which leads a subject to 
believe he can handle the problem in 
the first place would be expected to 
militate against such recognition. 
Thus, H-Ab S and Av-Ab S might 
often arrive at conflicting solutions. 
e Av-Ab S would have little if 
any opportunity to check the credence 
of his solution (as he has in a task 
like arithmetic) the group’s final solu- 


tion may be the result of a power 
struggle. If Av-Ab S is more ascend- 
ant than H-Ab 8, or if the Av-Ab Ss 


coalesce against H-Ab S, results very 
much like those obtained in these ex- 

periments would be predicted. 
Explicitly, with respect to small 
group problem solving, this analysis 
points to further examination of the 
dynamics of groups engaged in prob- 
lem-solving tasks whose substantive 
components, while requisite for ap- 
propriate solution, are still less clear 
as true of the problems em- 

ed in these experiments. 


Sine 


than W: 
ploy 
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Р 
E EFFICIENCY OF TWO MULTIVARIATE 
TISTICAL TECHNIQUES IN COMPARISON 
WITH CLINICAL PREDICTIONS 
C. E. ALEXAKOS' 
University of Wisconsin 


The college grade-poi i ior hi 
-point average of academically superior high school 
ao was predicted by statistical methods (regression and dis- 
diene analyses) on the basis of 19 independent variables, and by 
a "t counselors on the basis of all information collected through 
abt seling interviews and testing during the 4 high school years. The 
ained results indicated that the statistical were slightly superior to 


Ше clinical predictions; however, both t 
efficient as one would wish them to be. Weighed combinations 
е ой increased the efficiency by 37% 
ў М 106 % in the most favorable. It was conclu 
i nical-statistical predictions could be more e 
n counseling college-oriented students than any 


ypes of prediction were not as 
of the 3 
in the least favorable case and 
ded that combinations of 
fficient and more useful 
single method. 


аи 


ыа 
oe эр of academic success be- 

major igh school level has been 
Psycholo concern in educational and 
Ozen een research. Less than a 
Paring ha ago Meehl (1954), com- 
with Ws оу of the statistical 
Pointed out A clinical predictions, 


So. И 

а Os y em the literature contains 
inica] so carefully executed studies on the 
- Psycholo ата] issue. Although а number 

Bists have ERU psychiatrists, and sociolo- 

th. ence cones the problem, empirical 

we two ict the relative efficacy 0! 
anting [p ag] of prediction is largely 


Sta Jobin: 

8 praticians have developed vari- 
Session Ctive techniques such as re- 
Barithmi, stimates, ratio estimates, 
rame ed estimates, and even non- 

eiric methods of prediction. In 
h and psychological re- 
ud most frequently used 

„тешеп has been the 

gression analysis. By 
me "least squares” method 
B Ou on analysis takes into 

Mong a „216 linear relationship 

Dumber of concomitant var- 


: N ow 
at -—- 
West Virginia University. 
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iables in order to predict a criterion 
variable. 

A multivariate technique which 
was developed almost simultaneously 
by Hotelling (1931), Fisher (1936), 
and Mahalanobis (1936), each work- 
ing independently in different parts 
of the world, is the discriminant 
analysis. The discriminant analysis 


by maximizing the ratio: 
Between groups sum of squares 
Within groups sum of squares 


seeks to maximize the distance be- 
tween or among groups. The dis- 
tance between pairs of groups may 
be colinear or not. This characteristic 


constitutes an advantage of the dis- 
criminant over the regression analy- 
sis, since the latter is restricted to 
linear relationships only. The method 
of discriminant analysis requires that 
measurements on the same variables 
are available for all members of the 
sample on which the discriminant 
functions are to be computed and 
all additional individuals that are to 
be classified into the different groups. 

f this method a given 


By means O 
population or sample could be clas- 
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sified into two or more distinct and 

mutually exclusive anre on the 
j iven measuremi А 

| - p em Rulon, & Bryan, 

1951), while comparing the pn 

inant function with the multiple re- 

gression analysis, observed that: 


i lation techniques are designed 
unie ace enseritlally fitting for the study 
of relationships between measurements: that 
is, between graded classes. And the relation- 
ships so studied are generally holding among 
the members of a single class out of various 
interesting unordered classes. ... The dis- 
criminant analysis does what the multiple 
correlation approach does not. It uses group 
membership as the criterion and makes all 
comparisons between groups and none within 
groups [p. 87]. 
Further, according to Tiedeman 
(Tiedeman et al, 1951) : 


discriminant analysis might also be used as 
a test of whether the regression on a set of 
measurements is really linear or not. If 
classification according to the criterion is 
used to define the several groups, discrimi- 
nant functions providing for maximum dis- 
crimination among these criterion classes in 


terms of the n measurements can be com- 
puted [p. 79]. 


It seems, therefore, that, while re- 
gression analysis cannot deal with 
diseriminant analysis problems, dis- 
criminant analysis can be adapted in 
such a way as to deal with some re- 
gression problems, if the groups are 
defined in terms of the criterion vari- 
able. | 

The present study was designed to 
compare predictions concerning the 
college achievement of academi- 
cally superior students made by re- 
gression equations, discriminant 
functions, and clinician _ counselors. 
Comparisons were made in terms of 
the obtained results and could possi- 
bly serve the purpose of accumulat- 
ing empirical evidence concerning the 
relative predictive efficacy of each 
method. As was mentioned earlier, 
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theoretical comparisons between gd 
gression and discriminant ee 
have been made by Rulon and peer 
man (Tiedeman et al, 1951), and abis 
tween statistical and clinical pred! 
tions by Meehl (1954). бв 
А word of caution is due here. v 
study is not intended to show Ks mes 
regression and discriminant ana pn 
can be applied simultaneously m 
indiscriminately to the solution he 
all problems of prediction. As e 
has pointed out, the two method er 
designed to answer two diffe E 
types of problems. In general, A 
gression analysis produces Lacer 
sults and offers more informa eN 
when the relationship between P- 
dictor variables and criterion 18 ies 
ear. Further, if the relationship sat 
tween predictors and criterion - in^ 
linear, exponential curves of t d to 
dependent variables can be ШЫ рег 
improve and supplement the ien 
regression. However, as the cet } 
the predicting equation is raise 
computational labor increases when 
proportionately, particularly Lien 
the number of predictors is ede 
Thus, the number of terms o © 
for computing a second degre, /2 
parabolic equation is n(n —. ples: 
where n is the number of epe Be 
When n — 20, the number © ine 
quired terms = 190. To deter. 
and evaluate the 190 terms re ^ 
very cumbersome, if not prohib! 
computations. d, 
The discriminant analysis meth? 
on the other hand, is designe ns 
deal with classificatory рг00 6, 
and can be used only when t P esed) 
terion variable is divided into cla nis 
such as low, average, and high. ^ jf 
type of nominal classification, p- 
course, is less refined than ar ре 
terval seale used in regression a nt 
lems. The results of the discri. pe 
analysis, therefore, are likely 


dis- 


Ver 


STATISTICAL VERSUS CLINICAL PREDICTION 


COarser | 
oimne hs general case. Then, 
Ente a analysis can offer evi- 
fhe a the linearity of 
and а between predictors 
Whether à on, and. thus indicate 
necessary Polynomial analysis is 
exemplifies not. The present study 
genera] one specific case of the 
Ш\л Nau. as discussed by 
al, 1951) Tiedeman, (Tiedeman et 
With the р and should not be confused 
general case. 


METHOD 


The 
mal was conducted at the Research 

ts at th e Laboratory for Superior Stu- 
dents i o University of Wisconsin. Stu- 
Program ay participate in the Laboratory 
they are are selected by their teachers when 
Students E He ninth grade. The number of 
he ias participating school sends to 
Schoo] ne A ory is proportional to high 
Per year an ranges from 2 to 5 students 

uidance The selected students visit the 

Су are j waboratory once a year while 
taking te n high school and spend the day 
and inven completing information blanks 
s takin ories, visiting university classes, 
ra pu part in counseling interviews. 
Laboratory о detailed description of the 
9 Rothne operations the reader may refer 
букш, (1961) article. 
Pants Wake samples of Laboratory partici- 
b ns and am that the regression equa- 
M. the first scriminant functions obtaine 

al could be applied to the second. 
female students were treated 
ecause previous analysis of data 
ad shown sig- 
dependent 


е а 
Separately. 
n Lal 
nificant ry, participants h 
i х 1 егепсев іп ѕоте іп 
р f E first-year sample was com- 
‘Males wh 3 students: 98 males and 65 

June 1962. graduated from high school 
pales айа » the second of 125 students: 72 
'gh schoo] d P who graduated from 

he 1901 im 1963. 
Subjects ollowing data were available on all 


; Nu 
Sübject Бег of courses taken in each of five 
s during high school and grade- 


Point 

20 averg 1 

Rrdinary Aono achieved in each field. (The 
Point scale was used to convert 


ades. 
Sa" erii quantitative points.) . 
tay Oeconomi. ating to each student's family 
or ned from il background were either ob- 
estimated he student’s information blank 
by high school staff members. 
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These were: (a) father’s and mother’s level 
of occupation, (b) father’s and mother’s 
amount of formal education, and (c) esti- 
mated family annual income. 

3. Laboratory records contained each stu- 
dent’s scores on the following tests: (a) 
Terman Concept Mastery, (b) Watson- 
Glaser Test of Critical Thinking, (c) Davis 
Reading Test—Level of Comprehension, and 
(d) Davis Reading Test—Speed of Com- 


prehension. 

Concerning the variables of father’s and 
mother’s level of education, each subject 
was assigned a numerical score on a scale 
from 1 to 8, the lowest score being assigned 
to parents who had not completed elemen- 
tary school and the highest to parents who 
held advanced degrees. The variables of 
parental level of occupation were weighted 
on a scale from 1 to 6. High school faculties 
had classified each student's family income 
into one of three broad categories: below 
$5,000, between $5,000-10,000, and above 


$10,000. 
Analysis of the data was performed by 
means of both multiple regression equations 
and discriminant functions for the year 
sample of 1962. The known grade-point aver- 
e 1962 sample at the 


age (CGPA) of th 
end of the second year in college served as 


the criterion variable in computing both the 
equations and the discriminant 


regression 

functions. The grade-point average of the 

1963 sample was not known until after its 
; the regression equations and 


prediction by 
the classification 0: 
the discriminant fu 
variable was internally relati 


dependent variables in the ге п 
tions, but was externally used to establish 


the groups and the а priori proportions of 
students who should belong to each group 
in the discriminant analysis. Thus, the 1963 
sample was used as а cross-validation 
sample in order to check the efficacy of the 
equations and functions obtained from the 


1962 sample. 


f the sample according to 
nctions. The criterion 
ed to the in- 
gression equa- 


RESULTS 


According to Multiple Regression 
Multiple regression equations were 
computed on the basis of 19 inde- 
pendent variables to predict the 
at the end of the 
ar in college. The 19 inde- 
pendent variables are presented in 
Table 4 along with the t-test values 
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TABLE 1 
p 963 STUDENTS CLASSIF r INTO E BAND 
‘AGE OF 1963 STUDE: IED ConnEcTLY INTO Eacu GPA 
ERCENT. 


By regression 


By discriminant analysis 


By counselors 


1 
SEPA Eine Male Female Male Female Male Female 
25 
30 2 
70 25 60 57 2 
erty 90 53 66 41 5 E. 
3.00-3.49 30 55 45 85 E ЕН 
3.50-4.00 00 33 8 33 


computed on the beta weights and 
the critical ratio values of the dis- 
criminant function. Variables 10 
(high school grade-point average in 
science courses) and 12 (mother’s 
level of education) were significant 
predictors of the college grade-point 
average beyond the 5% level for 
male students, Variables 6 (high 
school grade-point average in Eng- 
list courses) and 17 (score on the 
Watson-Glaser Test of Critical 
Thinking) were Significant predic- 
tors beyond the 2% level for females, 
The multiple correlation coeffi- 
cients were .74 and .69 for males and 
females, respectively, According to 
these coefficients it seemed that by 
using the obtained regression equa- 
tions one would be able to predict 
the CGPA of similar samples of stu- 
dents at a rate Significantly better than 
chance. The equations Were actually 
applied to the 1963 sample and the ob- 
tained results are presented in "Tables 


with the classification made sd |= 
discriminant analysis tests an ion 
the counselors the following ee 
classes (or CGPA bands) were se 


Band 1: CGPA less than puma 
Band 2: CGPA between pen 
Band 3: CGPA between ШШ 00. 
Band 4: CGPA between 3.50-4: 


à 18 
According to Discriminant Analys 


:envimi- 
As explained earlier, the perd 

nant analysis test can be pet ple 
classify a given population or пау 
to а number of distinct and n 0 
exclusive groups. The proporti oup 
individuals falling into each Pally 
can be determined either cca 
or a priori on the basis of apes mple 
knowledge. In the present 3 em- 
the proportions were established Sy 
Pirically on the basis of the known: 
of the 1962 sample which was 


ale 
The 1962 sample contained 10 Tab 


Ae and 7 female students who Ta 1); 
1, 2, and 3. The predictions obtained tained CGPA below 2.00 eem had 
by regression were point Predictions, 37 males and 27 females byork 99 
but for the purpose of comparisons achieved CGPA between 2: 
‘ABLE 2 
PERCENTAGE OF 1963 STUDENTS PREDICTED то PERFORM AT A HIGHER 
LEVEL THAN Тивү AcTUALLY Dip 
By regression By discrimant analysis Hoia 
le 
ERDI fani Male Female Male Female Male = 
75 
30 75 40 43 70 59 
к 3 4T 31 35 65 10 
3.00-3.49 0 - - E на 


=A : 


2 
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TABLE 3 


PERCE 
NTAGE OF 1963 STUDENTS PREDICTED TO PERFORM AT A LOWER 
LEVEL THAN THEY AcruaLLY DID 


CGPA band By regression By discriminant analysis By counselors 
Male Female Male Female Male Female 
9 
2.00- 
Hee E 0 3 24 3 0 
3.50-4 00 70 20 45 10 15 35 
s 100 67 92 67 62 75 
(Band 
2): М 
); 40 males and 22 females а linear model than did the male 


wi 
toe between 3.00-3.49 (Band 
GPA b males and 9 females with 
hone Ween 3.50-4.00 (Band 4). 
by the Pam to the CGPA obtained 
of individ 62 sample, the proportions 
"iin uals belonging to each band 
mputed as follows: 


Males 
Ba 5 Females 
ang 1/10/08 = 10 7/85 = Ш 
and 2: 37/98 = .38 — 27/65 = 41 
Bang 5: 40/98 = .41 20/05 = 94 
4: 11/98 = a1 9/65 = -14 


ternal Proportions served as the ex- 
analysis riterion in the discriminant 
ividua] test for computing the in- 
© basi discriminant functions, on 
the q is of which the members of 
ne of + Samples were classified into 
Чопа] d e foregoing bands as addi- 
9f the pit That is, the classification 
Ordin 63 samples was made ac- 
ion. 5, to the discriminant func- 


lons я 
Ples „шей from the 1962 sam- 
“pirical according to the foregoing 


e 2, Ploportions. 
function, btained overall discriminant 
Dificanr Were both statistically sig- 
leve] an the one for males at the .05 
1 leve] the one for females at the 
between ` The generalized distances 
Signifie pairs of CGPA bands were 
ang „Cant between Bands 1 and 3, 1 
between i 2 and 3 for males, and 
er ir and 1 and each of the 
these oun” for females. In view of 
the ап results, it appeared that 
Male g criminant function of the fe- 
Sample more nearly approached 


sample. The male discriminant func- 
tion was rather curvilinear at the 
upper end because Group 4 was not 
significantly different than Group 2 


while Group 3 was. 


According to Counselors’ Predictions 


Counselors had been asked to pre- 
dict each student’s level of perfor- 
mance in college shortly before the 
student's graduation from high 
school. In doing so, they were guided 
by the following classification: 


He or she will: 
A. be eligible for major awards 


based on scholastic performance. 
B. be eligible for minor awards 
based on scholastic performance. 
C. complete training. Will not be 
eligible for awards nor on probation. 
D. take longer than usual time to 
graduate though never on proba- 
tion. 

E. drop out of 
other than scholastic difficulty. 

F. be a marginal student. Will be 
on probation at sone time. 

G. drop because of low grades. 


Although the foregoing descriptive 
categories do not seem to fit distinct 
CGPA classes, their meaning was 
thoroughly explained to predicting 
counselors in seminars and formal 
discussions conducted at the Guid- 
ance Laboratory. Thus, A meant 
that the student will be included in 
the Dean's list (GPA = 3.50 or bet- 


training for reasons 


302 


Constantine E. ALEXAKOS 


TABLE 4 


VARIABLES INVOLVED IN THE REGRESSION AND DISCRIMINANT ANALYSIS TESTS 
AND THEIR CRITICAL VALUES AS PREDICTORS 


Regression Discriminant Analysis 
Variable 

Males Females Males | Females 
Number of English courses in high school —.824 —.791 .361 .572 
Number of mathematics courses in high school —1.374 .930 .523 | 1.100 
Number of foreign language courses in high school .541 .980 .064 | 1.691 
Number of social studies courses in high school — .269 — .266 .408 .201 

Number of science courses in high School .547 1.108 .533 1.311 | 
High school GPA in English 1.590 2.563* 294 | 5.096 
High school GPA in mathematics 1.222 | —1 059 “692 .100 
High school GPA in foreign languages .603 “709 .323 .416 
High school GPA in social studies 1.05 | —.716 | 1.082 | .389 
High school GPA in science 2.085* 1.029 .954 .859 
Father's level of education 469 | —.04 741 | 1.172 
Mother's level of education 22917*| — ‘655 | 3.052* | .139 
Estimated family income ‘891 — 372 i 379 | 1,226 
Father's level of occupation ~ 710 042 `057 | 1.125 
Mother's level of occupation 1397 — "664 “202 ‚218 
Terman Concept Mastery score 1.234 ‘940 | 2.423 2.442 

Watson-Glaser Test of Critical Thinking 1925 2.514* 1514 2.981, 
Davis Reading "Test: Level of Comprehension —1.382 —1.066 “100 5,598 
Davis Reading Test: Speed of Comprehension .056 038 4 1.458 

Note.—In the 


regression tests df = 7; 


In the discriminant analysis tests df = 3 


*p < 05. 
“p< 01. 


bation (GPA = 2.00 or higher), E 
F, and G = potential dropouts on 
probationary students (GPA = be. 
low 2.00). The results of the three 
predictions are presented in Tables 
1, 2, and 3. 

Predieting counselors had available 
4 years of data collected by the 
Laboratory during the Students! an. 
nual visits. These data consisted of 
test scores, high school grades, in. 
formation on students' family back- 
ground, participation in activities 
and leadership roles, educational and 
vocational plans, and information 
collected through the counseling in- 
terviews with both students and 
their parents, as well as autobiog- 
raphies and other samples of stu- 


8, 45 for mal 
» 76 for males; 3, 43 for females. 


" ely: 
e and female students, respectiv" 


z 
dents writing, That is, counselor 
had available all the measurem? ys 
used in the statistical analyses F^, 
Various other types of informi tis- 
that were not included in the St^ 
tical predictions, were 

The statistical predictions "is 
made on the basis of measure ug 
9n 19 variables, which are prese re" 
in Table 4. The 19 variables аге they 
Sented in the order in which 575 
Were analyzed and not according E 
their significance as predictors. of 
in Table 4 the level of significan? jg, 
each predictor variable is er the 
Student t tests were used to Te іл 
Significance of the beta weigh test? 
the regression equations, and 
m the discriminant analyses 
critical ratio values were prin 160 
both cases on the output of the 
digital computer. 


e, 


| 
| 


ý 
( 


Sr. 
ATISTICAL VERSUS CLINICAL PREDICTION 
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Hırs AND Misses I Ios m 
Б s By Касн Prepictive М 
Метнор ann Cut Square Т 
ESTS 


BETWEEN 
SEN PAIRS 
= OF METHODS FOR THE 1963 SAMPLE OF MALE STUDENTS 


CGPA iso Predictive method 
Multi Ipiscrimi 
eec Bee iii соме ls ш 
Hits ES 
Misses : | f Н 200 3.200 1.800 
ч 10 30 Mi р = .325 р = .078 р = .160 
2 Hits 
Misses T 10 11 4.857 7.152 4.419 
i 29 20 EA р < .05 р < .01 р < .05 
its 
Misses x " п .960 2.556 400 
1 9 = .30 = = 
T 20 20 20 á yon i 
18 
lisses 0 3 5 3.391 6.190 2 
y E. E 5 E: 3 722 
р < .10 < .02 =. 
13 13 13 9 ре 


Note.—, 
á df — 1 for all chi-square values. 
Отта; А 
of E fedi rOn the Three Types 
Om H 
fen i risons of the predictive po- 
the Other 2 method with that of 
h numb Wo were madé in terms of 
Classified er of students who were 
Me in correctly (hits) in each 
Misclassified foe to those who were 
Stu ent’s ed (misses) in light of each 
ie the наа. grade-point average 
све, The of the second year in col- 
Y each number of hits and misses 
nificance 1ећой and the tests of sig- 
are Pres between pairs of methods 
пае i den in Tables 5 and 6 for 
"Ev. female students, respec- 
"Tom Р 
vh p able 5 it can be seen that 
{есіру є? 28101 analysis was more ef- 
E than were the counselors (x? 
hens .08) in predicting the 
ctions к CGPA. Regression 
e Obtained | also superior to 
ods in B by the other two 
T} 05; and and 2 Gas? = 4897, P 
le discri Xa? = 7.152, p < .01). 
Iminant analysis was more 


lowes 
thos 


effective than the counselors in Band 
2 (xoi? = 4.419; p < .05). 

Ín Band 3 the counselors pre- 
dicted slightly better than the re- 


gression equation (x? = 2.556; p < 
10), and in Band 4 the counselors 
predicted significantly better than 
the regression equation (x? = 5.190; 

< .02). Also, the predictions by 
the , discriminant analysis were 
slightly better than those of regres- 
sion in Band 4 (? = 3.391; p < 


0). 


Investigation of Table 6 reveals 


that the predictions made by the 
discriminant function were more ef- 
fective than those made by the other 
two methods in Bands 1 and 3. Exact 
tests of significance (Ferguson, 
1959, p. 173) produced the probabili- 
ties of .014 in Band 1. In Band 8 the 
x? values were 3.199 (05 < p< 
.10) between regression and discrimi- 
nant analysis, and 5.729 (p < .02) 
between discriminant analysis and 

In Bands 2 and 4 the ob- 


counselors. 
tained results were 50 similar that no 
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TABLE 6 


v ETWEEN 
Misses By Блсн Prepictive METHOD AND Cut Square Tests BE 
PoE ane or METHODS FoR THE 1963 SAMPLE оғ FEMALE STUDENTS 


Predictive method | 
xat хы? Xas 
CORDS tees Multiple | Discriminant Counselors 
regression analysis 
i 6 1 
1 Hits 1 _ z = ои 
Misses е Д б р = .014 p = .50 p 
.121 
2 Hit 9 7 6 -500 1.929 Muro 
D 7 9 10 p = .50 р = .20 p= -@ 
N 16 16 16 
3 Hits 11 16 9 3.199 422 ae 
Misses 8 3 10 10>p> .05} р = .50 Bs 
N 19 19 19 
4 Hits 4 4 3 a ТРИ s= 
Misses 7 7 8 
N 11 11 11 мер 
Note.—df = 


1 for all chi-square values, 


differences. could exist between the 
three methods. 


Discussion 


predictions depend 
(a) the overall simi- 
the samples on which 
r functions were com- 
puted and the samples to which they 


are to be applied for predictive (or 
cross-validation) purposes, (b) the 
degree to which the empirical data 
fit the linear regression model (good- 
ness of fit), and (c) the amount 
of error involved in the measurement. 
of both predictor and criterion vari- 
ables. Clinical predictions depend, to 
a large extent, upon the global im- 
pression a counselee makes upon his 
counselor. Affected by the overall im- 
pression, the counselor tends to ig- 
nore some bits of information con- 
cerning his counselee and overweigh 
others. Meehl (1954, p. 92) presents 
evidence that “the clinicians sys- 
tematically over-predicted grade aver- 


Statistical 
mainly upon: 
larity between 
the equations о 


lee 
age,” and calls this tendency the le 
niency error,” ietical 
In the present study the un 
predietions appeared to be supe al- 
to those made by counselors, e- 
though they too were far from er 
quate, probably because the data d 
not satisfy all the aforemention 
assumptions, The regression ed tly 
tion, for example, identified correc ет f 
more male students in the Opis 
two bands of CGPA than in the ge 
per two. This result seemed pen 
because, according {о regner ег 
theory, the middle bands, or ra A, 
the students close to the mean C otly 
should have been predicted eri 
More frequently. To obtain pes i 
result, however, would assume ide re- 
eal distributions of the measure; 
ments on all 19 variables between 
1962 and 1963 samples. — not 
Evidently this assumption was re- 
satisfied by the data. Failure to od 
dict the upper bands, while ^ 
fully predicting the lower ones, $ 


` 


b omm „ 


STATISTICAL VERSUS CLINICAL PREDICTION 3 
05 


a ily that the measurements on 
same) variables for the 1963 male 
than ti were, on the average, lower 
this those of the 1962 sample. For 
woul кшш the regression equation 
1963 prediet lower CGPA for the 
the sample, and consequently miss 
РА eee upper-band performers. 
sult her cause contributing to this re- 
of aee be the fact that the data 
їв does example did not fit the linear 
com ssion model as was shown by the 

pee diseriminant function. 
org contrast to regression, counsel- 
сор more successfully the 
Киса of the upper bands of male 
iim but at the same time over- 
man ated the college performance of 
^ S students. This result seems to 
toe on with the studies re- 
Servatio y Meehl and supports his ob- 
error” n concerning the “leniency 
eth the sample of female students, 
ieu осш difference was ob- 
multi „ееп the predictions by 
ИЕЫ] regression and clinician 
Жош; Significant differences, 
sults ey were found between the re- 
é oi the discriminant analysis and 
Seems er two methods. This finding 
forma; to imply that the college per- 
More ex of females can be predicted 
membe | ciently on the basis of group 
Or eli er ship than by linear regression 
the ps judgment. Particularly, in 
Colle wer band female students left 
bere not because of poor academic 
None o but for other reasons. It 
Crimin from the results of the dis- 
dents ant analysis that female stu- 
Sip, DO dropped out of college had 
an x characteristics in both years 
ed aua be singled out and classi- 
Жеш y on the basis of meas- 
Ae ts on the 19 variables. 
result ummary, in spite of the better 
digo? Obtained by statistical pre- 
ms, all three methods of Р! 


TABLE 7 
COMBINATION OF THE THREE METHODS 
TO OBTAIN A COMPOSITE PREDICTION 


Average % 
ciency 


Relative % 
Predictive method эу 


Males |Females| Males |Females 


Clinical 38.5 | 34.0 | 100 

c К : 1 

Multiple regres-| 48.0 | 41.5 | 125 а 
sion 

Discriminant 48.0 | 54.0 | 125 | 159 
analysis 

Composite 62.5 | 70.0 | 162 | 206 


diction leave much to be desired in 
terms of reliability and accuracy. 
Statistical predictions could be та. 
proved by more precise and more 
reliable measurements as well as by 
the inclusion of more predictor vari- 
ables in the matrix of linear equa- 
tions. Counselors’ predictions could 
be enhanced by à consistent effort to 
avoid “leniency errors.” 

From the obtained results it ap- 


peared that combination of the three 
methods could produce more efficient 
ingle method. 


predictions than any Si 
For this purpose, each predictive 
method was weighed according to its 
average efficiency, and the results of 
Table 7 were obtained. It is evident 
from Table 7 that by a combination 
of the statistical-clinical methods a 
gain in efficiency ranging from 37 to 
106% was obtained. This result seems 
to imply that à reconciliation be- 
tween statistical and clinical meth- 
odology; instead of the present ani- 
mosity, might serve better the cause 
of counseling students. i 
Each method has its own merits 
but none i$ flawless. Statistical anal- 
yses can combine the available meas- 
urements in à way that no clinician 
counselor can. Clinician counselors, 
on the other hand, can sense and take 
jnto account some individual charac- 
teristics of their clients that no sta- 
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tistical analysis can. Combinations 
of  clinical-actuarial predictions, 
therefore, could be more reliable and 
more useful in the process of coun- 
seling. . 

In this study the regression analy- 
sis showed that for male students 
the variables of high school GPA in 
science and mother's level of educa- 
tion were the most significant predic- 
tors. The importance of the former 
variable can be explained by the ob- 
served trend among male students to 
major in the fields of science, mathe- 
matics, and engineering in college; 
the latter is more difficult to explain. 
It seems that mothers with a higher 
level of education exert some motiva- 
tional influence on 


to college and succeed. For female 
students the 


Critical 
the most significant 
result can be ex. 
f most female stu- 
dents majoring in the humanities, 
social studies, and kindred fields in 
college, which require high profi- 
ciency in English 

Similar results were obtained by 
the discriminant analysis test, How- 
ever, the variable of high school 
GPA in science was not Statistically 
significant in this test for male sty- 
dents, and the score on the TCM Test 
approached the .05 level of signifi- 
cance but was not large enough to 
be declared significant. The discrimi- 
nant analysis on the female sample 
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followed the same pattern as the w 
gression analysis with the variab В 
of GPA in English and the Dawe 
Reading Test: Level of Sale 
sion being significant at the .01 lev | 
the variable of the Watton Geni 
Test at the .05 level, and the TC е 
Test approaching the .05 level wd 
nificance. All significant diserimi- 
nants for females were verbal aed 
bles. In view of these results, a 
seems logical to believe that aed 
ciency in verbal and language xi е 
із а strong predictor of co t 
achievement for female students, 

not so for male students. 
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COMPARISON OF 

: THE EFFECTS OF POSITIVE AND 

NEGATIVE VICARIOUS REINFORCEMENT IN AN 
OPERANT LEARNING TASK 


RICHARD KELLY 
DePauw University 


In thi: Я i 
n this study 60 male and 60 female kindergarten children were tested 


E EUN pairs with 1 S serving as a model and 
an observer. Ys of the models performed a simple motor task u 


the 2nd S as 
nder 


“ы positive social reinforcement (praise), Ys under regular nega- 
ro reinforcement (criticism), and Y% received no reinforce- 
e) сз ег femp v models perform, the observers worked on the 
dir р Д irect reinforcement. The results indicate that in both the 
Ri vicarious reinforcement groups the highest operant rates 

maintained under the negative reinforcement and the lowest rates 


H m ; Wu. 
in the "no reinforcement" condition. 


j ME of investigations have 
lis E аа. with a comparison of 
Eisen effects of reward and 
еве ке on children's learning. 
articles Tu have been reviewed in 
(1964) y Kennedy and Willcutt 
jud and Marshall (1965). 
studies d have been several recent 
Vicarioy ealing with observational or 
Chrest, ТО learning (Barnwell & Se- 
fer, 1965 65; Berger, 1961, 1962; Kan- 
Osenbar’ Miller, Zavos, Vlandis & 
wo Daum, 1961; Sechrest, 1963). 
bog, of these studies have included 
ions, tie een and punishment condi- 
direct See (1963) suggested that 
оу ait positive or negative re- 
an Кы to a performer represents 
ап obse ‘ite implicit reinforcement to 
Completing who is simultaneously 
Subject ng a similar task. Thus the 
Teinforg (S) observing a positively- 
Isheq a model is implicitly pun- 
торо, lle S observing а negatively- 
model is implicitly 
hereto: - On a subsequent task, 
Shoulq €, the responses of the observer 
former not parallel those of the per- 
mie has been directly rein- 
f Serverg е findings indicated that 
cement responses following rem- 
Site h to a performer were oppo- 
Ose of the performers’ Te- 
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sponses. There were no significant 
differences between the groups that 
received explicit and implicit rein- 
forcement but a reliable difference 
between the positively- and nega- 
tively-reinforced groups. In a second 
study (Barnwell & Sechrest, 1965) in 
which implicit reinforcement effects 
were ruled out by having the observer 
rm only after the reinforcement 
been given to the model, first- 
onded to vicarious posi- 
tive reinforcements in a 
way which paralleled the performer’s 
responses to direct reinforcement, 
while third-grade Ss failed to respond 
differentially to the two types of rein- 
forcements. Since the criterion was 
whether the observer chose to work on 
the same or a different task than the 
performer, no comparison of the rela- 
tive effects of reward and punishment 


was possible. 

The purpose of the present study 
was to compare the relative effects of 
positive and negative vicarious теш- 
forcements in an operant learning task 
which precluded implicit reinforce- 
ment effects. It was hypothesized that 
the effects of vicarious reinforcement 
would parallel those of direct rein- 

an earlier operant learn- 


forcement. In 
ing study, Kelly and Stephens (1964) 


perfo 
had 

grade Ss resp 
tive and nega 
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found that negative reinforcements 
given directly had a greater effect on 
response rate than positive reinforce- 
ments. A second hypothesis was, there- 
fore, that the effects of vicarious pun- 
ishment would be greater than reward. 


METHOD 


Subjects 


The Ss were 60 male and 60 female 
kindergarten children enrolled in four pub- 
lic elementary schools in Greencastle, Indi- 
ana. They ranged in age from 5 years, 5 
months to 6 years, 9 months and were 


predominately from middle and upper-lower 
class families, 


Apparatus 


The apparatus, which was adopted from 
Stevenson and Fahel (1961), has been de- 


scribed in detail elsewhere (Kelly & Steph- 
ens, 1964), It con: ly р 


31 X 30 X 4 inc 
table 28 inches hi 
box was a bin 1354 
taining 750 marbles, 


inches high across the midi ч 
shielded E's recording int Be bor 


materials b 
E's head and shoulders visible o 8 o 
Procedure 


The children were tested in Pairs with 
each S being randomly assigned a partner 
of the same sex. One member of the pair 
was designated as the performer (P) and 
the other as the observer (О). The P was 
instructed to stand in front of the appara- 
tus while O stood to his left so that the 
responses of P could be easily observed, 
The Ss were then given the following in- 
structions: 

Now we are going to play a marble 
game, (name of P inserted) is 
going to play first and then it will be 
— — —'s (name of О inserted) turn. Let 
me show you how to play. Take a marble 
from here and drop it in a hole (E dem- 
onstrated by taking a marble from the 


bin and dropping it in one of the holes in 
the tray). You can use any of the holes 
you want but be sure to take one marble 
at a time and use only one hand. Now 
you try (P was given an opportunity to 
make a response). We won't use all the 
marbles that are here in the game. I will 
tell you when to stop and then it will be 
’s (name of O inserted) turn. 
There were three learning conditions: 
(a) regular positive social reinforcement 
(praise), (b) regular negative social rein- 
forcement (reproof), and (c) no reinforce- 
ment. The learning task, which required 6 
minutes, was divided into 1-minute intervals. 
In the positive and negative reinforcement 
conditions, no reinforcements were given 
during the first 1-minute interval, Beginning 
with the second minute the first response 
after the first and thirtieth second of each 
minute was reinforced, giving a total of ten 
reinforcements during the learning task. In 
the positive reinforcement condition Ё made 
supportive or encouraging statements about 
P's performance, while the reinforcements 
in the negative condition consisted of criti- 
cal comments. Since the purpose of the 
study was a comparison of the effects of 
positive and negative reinforcement, the 
reinforcements were worded so that both 
reward and punishment would produce the 
Same response tendency—increase in rate 0 
response. This was accomplished by includ- 
ing in the reinforcing comments the impli- 
cation that speed of response was important. 
For example, the positive reinforcements 
included statements such as “Good—that's 
fast,” while the negative reinforcements 1D^ 
cluded such comments as “That’s slow— 
not too good" or “That’s not very fast." 
reference to speed was included in the first 
reinforcement in each 1-minute interval. 
the control group P performed the task for 
minutes without receiving any reinforce- 
ments, 
. When P had completed the task, Ss were 
instructed to change places and O was al- 
lowed to play the game for 6 minutes. During 
the time О performed the task no rein- 
forcements were given by E. А 
At the end of the experimental sessio 


both Ss were commended for their perform- 
ance, 


RESULTS 
The number of responses (marbles 
dropped through the holes by S) Ре? 
“munute interval was used as the cri- 
terion measure. The data for the 


Errect or Vicarious REINFORCEMENT 


&roups receiving the direct and vicari- 
"à reinforcement were analyzed in 
z separate 2 X 3 X 6 repeated 
easures analyses of vari 1 
1962. y variance (Winer, 


Nur analysis of variance of the data 
{ 3 the groups receiving direct rein- 
orcement showed a highly significant 
elnforcement main effect (F = 11.42, 
yg TN р < .01). The curves shown 
fir 1 indicate that the group 
inm negative reinforcement 
fede ec the highest overall response 
бе ed the first minute, while the 
isis "te group responded at the lowest 
Were he Sex and Trials main effects 
an not significant, nor were there 
Y Significant interactions (p > .05). 
fs: the analysis of variance of the data 
е М е vicarious reinforcement groups 
wr o that the Reinforcement main 
713 Pd highly significant (F — 
indi; f = 2/54, p < 01). Figure 2 
iene that, as was the case in the 
i reinforcement condition, the 
oa response rate occurred in the 
жы Which received negative rein- 
a m ub followed by the positive 
ont Orcement group, and then the 
Tol group. There were no other 


“ 


26} 


e 


MEAN NUMBER OF RESPONSES 


$ DIRECT POSITIVE REINFORCEMENT 
teh x DIRECT NEGATIVE REINFORCEMENT 
NO REINFORCEMENT 


5 


ig Leer Me 
: 4 6 


2 3 
T TRIALS 


Iq, 
Rou 5 Ъ Mean number of responses of 
eement IDE three types of direct rein- 
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20 

@ VICARIOUS POSITIVE REINFORCEMENT 
© VICARIOUS NEGATIVE REINFORCEMENT 
4 NO REINFORCEMENT 


MEAN NUMBER OF RESPONSES 
S 


' 2 4 5 6 


3 
TRIALS 

Fic. 2. Mean number of responses of 
groups receiving three types of vicarious 
reinforcement. 


significant main effects or interactions 
(p > .05). 


Discussion 


The performance of the groups re- 
ceiving direct reinforcement in the 
present study provides a replication 
of the earlier Kelly and Stephens 
(1964) findings to the extent that 
there was a greater response to nega- 
tive reinforcement than to positive 
reinforcement. 

The responses of the vicarious rein- 
forcement groups support the experi- 
mental hypotheses. The responses of 
the Os in both the positive and nega- 
tive reinforcement conditions paral- 
leled those of the Ps in that both 
groups showed a higher response rate 
than did the control group. Also, as 
predicted, the response of the Os in the 
negative reinforcement condition was 
greater than that of the Os in the posi- 
tive reinforcement condition. The 
parallel response of the Os is consist- 
ent with the Barnwell and Sechrest 
(1965) findings with first-grade chil- 
dren and lends some further support 
to the hypothesis that implicit rein- 
forcement effects occur only when O 
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is performing a task concurrently with 
E possible interpretation of Ше 
present results is in terms of an ex- 
pectancy hypothesis. The Os, by view- 
ing the model perform the experi- 
mental task, obtained information as 
to the correct or desired response and 
also observed the type of reinforce- 
ments (ie., positive or negative). It 
may be that as a result of this the Os 
developed a set of expectancies for 
either reward or punishment in the 
situation, and the effect of these ex- 
pectancies was similar to that of the 
direct reinforcement, That is, expec- 
taney for punishment had a greater 


effect on performance than expectancy 
for reward. 


ceiving orcement. 
Furthermore, since O performed im- 


mediately after P, it would appear 
reasonable to assume that an increase 
in drive level as the result of identifi- 
cation with the model would carry 
over into the test situation, The higher 
drive in the vicarious negative rein- 
forcement group would then explain 
the greater response rate of this group, 

Several studies have Suggested that 
punishment administered to the child 
can have detrimental Side effects 
(Bandura & Walters, 1959; Sears, 
Maecoby, & Levin, 1957). Since the 
present findings indicate the effects 
of vicarious punishment parallel those 
of punishment administered directly, 
the question is raised as to whether 
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detrimental side effects are also pro- 
duced in the child who observes 
another being punished. 
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AGE AND MENTAL ABILITIES: 
A SECOND ADULT FOLLOW-UP 
W. A. OWENS! 
Purdue University 


The 96 Ss, first tested in 1919 and again in 1950 with the Army Alpha, 
were again tested with the same test in 1961 at an average age of 61 
yrs. This paper reports the 1950-1961 results. The decade from age 


50 through age 60 was one of relativ 


e constancy in mental ability test 


Performance for the present Ss. The apparent trend, which is statisti- 
Kis ly insignificant, is slightly downward. Contrary to expectation, aging 

oes not seem to have increased either individunl or trait differences 
among the testees of 1961. Clearly, however, major components of the 


pattern of living of these Ss have ser 
Ship between age and mental ability 
ee 2m the writer published a re- 
changes a longitudinal study of age 
jects (S 1n mental abilities. The sub- 
1919 ra. were 127 male freshmen of 
Alpha y had originally taken Army 
nation at 1 6, as an entrance exami- 
"eon UA State University. They 
i ae ested during 1949-50 with 
id ы examination ассот- 
cet „7 9 Personal Information 
ndeg» composed of twenty “open- 
tate à items. Results obtained indi- 
tremo, Pbstantial and significant in- 
a in 1п test scores over the years, 
er] 5 Subsequently confirmed in 
9ngitudinal studies by Bayley 
Nispet Bentz (1953), Charles (1953), 
al (1957), and others. These re- 
forene? Suggest, that individual dif- 
tomain, and trait differences each 
nitude. ot substantially constant mag- 
amo, 20d that such variables as 
S of education, field of college 
| ation, rural to urban migra- 
опар mbers of hobbies and recrea- 
Ae s tivities, and earned income 
hir, ;Portant correlates of temporal 
тр 11 test score, 


EN d 
Tul Pal study covered the Ss’ 


Wi 
а 16 
ud Jon the collaboration of Don C. Charles 
Ro harq S & Bath and the assistance of 
богы" Gillmer, Paul R. Kelley, Michael 
апа Lyle Schoenfeldt. 


Deci, 
tion Siz 
> nu 
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ved as moderators of the relation- 


mean ages 19 to 50. The present 
follow-up extends this coverage ap- 
proximately another decade to mean 
age 61. It was undertaken for at least 
two reasons. First, because the writer 
is unaware of comparable longitudi- 
nal, descriptive data; and, second, 
because it was believed that an at- 
tempt to identify the life-experience 
or life-history correlates of the antici- 
pated shifts in test score would move 
beyond the simple and usual descrip- 
tion of intellective changes towards an 
explanation of them. 

Specifically, the problems of this 
investigation are three in number: 
(a) to describe the impact of an addi- 
tional 10-year age increment upon the 
mental abilities measured by Army 
Alpha; (b) to appraise the effects of 
the given age increment upon the 
magnitudes of individual differences 
and trait differences, as measured; 
and (c) to reveal, via a scored life 
history, characteristics or patterns of 
living which are typical of those 
whose mental abilities age has treated 
more versus less kindly. In all cases, 
emphasis will be upon the last 10- 
year period, but an attempt will also 
be made to integrate the data of the 
last 10 years with that of the preced- 


ing 30. 


312 


METHOD 


In essence, these problems will be at- 
tacked in order by: (a) noting shifts, over 
time, in the levels of the mean verbal (V), 
numerical (N), and reasoning (R) com- 
ponents of Alpha or of its subtests, with 
some tentative correction for accompany- 
ing cultural changes; (b) comparing ear- 
lier and later variances, across tests and 
across people; and (c) item-analyzing an 
extensive scored life history against criteria 
of changes in the Alpha component scores 
from 1919 to 1950 and from 1950 to 1961, 
with subsequent factoring of the “valid” 
items. 


Hypotheses 


Based upon prior knowledge obtained 
from the first follow-w 


experience with life-history data and fac- 


In component Alpha 
у the work of Kall- 
959) on senescent twins 


с conditioning of a de. 
cline in many critical ch 


Suggest that adult 
the quantity 


(1961, p. 363). If this 


1 and the present 
data permit an adequate test, then the 
preceding hypothesis Should be Sustained, 


The second hypothesis is that the two 
point estimates of ability Previously ob. 
tained (1919 and 1950) lie along a typical 
parabolic growth curve now extended para]. 
lel to the base line; thus, that there should 
be no significant change in overall level of 
score from 1950 to 1961. This position is in 
Contrast to the interpretation of Jones 
(1959), who believed that the present Ss had 
Probably improved their potential scores 
so greatly during the college years that a 
subsequent decline was still more than com- 
pensated. If Jones’ view were correct, the 
decline begun at the Ss’ mean age 23 should 
continue and accelerate through their mean 
ages 50 to 60. Conversely, if no decline ig 


apparent during this last decade, it seems 


W. A. Owens 


highly unlikely that one would have been 
detected some 38 years earlier. " 
The third hypothesis is that, based on ар 
assumption of the onset of some diferentie 
organic changes, slight but satistically ae Я 
icant increases in the magnitudes. of bo А 
individual differences and trait difference 
from 1950 to 1961 should be revealed. _ 
The fourth hypothesis, assuming & m 
tinuous nature-nurture interaction, is nn 
socioeconomic status should be a correlate 
of relative improvement in test, performance i 
The fifth hypothesis is that the long. 
Tange importance of motivational eT 
is so great that a number of what Wechs 4 
(1950) might call “nonintellective deter- 
miners of intelligence” should be found to 
related to a relative improvement in te 
performance. : 
The sixth hypothesis is that by pem 
of some recent organic changes in 80! bo 
Ss, failures in health and vigor should d 
found to be related to a relative decremen 
in test performance from 1950 to 1961. . 
The seventh hypothesis is that the i 
cial introvert, less accustomed to socis 
interaction, would be less disrupted ү 
Some essential age-related disengagemen y 
thus, that introversion should be positive 


: a 
related to test, performance over the interv 
1950-1961. 


Subjects 


The Ss were 96 males, of mean age ү at 
1961, who had originally been teste en 
Iowa State University as entering {гөште 
in 1919. The typical S may be character! 5 

s follows: (a) He lived in a rural area ce 
small village prior to college but has Pom 
moved to the city; (b) he graduated fr ne 
college with a major in agriculture oF а 
gineering; (c) he is married, owns his а 

ome or apartment and has had one or айе 
children; (d) he still resides in the pne 

est; (е) his occupation is found towar nce 
upper end of the occupational-intelligo 
lerarchy; and (f) his reported net AE 
for 1960 was between $7,000 and $150 ila- 

The 96 Ss on whom data are here ay fr 

le represent a 75% sample of the 127 ere 
retested during 1950, Of the 31 lost, 13 ts 

eceased, 5 were disabled, 5 could no 

Ocated, and 8 refused to participate. ^. 
further note regarding bias appears 18 


M easuring Instruments 


En in 
Two measuring devices were utilized р 
this study. The first is the Army roxi- 
“xamination, Form 6. It requires PF able 
mately 25 minutes of testing time an 
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T TABLE 1 
ST AND COMPONENT RELIABILITIES 
OF Army ALPHA, Form 6 


Retest reliability 
Subtest 
component 1919 | 1950 | 1919 | Component 
ver- ver- ver- predicted 
sus sus sus 
1950 | 1961 | 1961 
Е folowing diree- 
.30 | .46 | .41 | Reasoni 
2 An i 
парео - 
m, 
CRM s -69 | .83 | .68 | Numerical 
,, ment ^l judg- m 
i Jnonym. .56 | .68 | .60 | Verbal (V) 
‚ Тапу! 
; Disarrango d «64 | .71 | -56 | Verbal (V) 
| Nees 
Number s +48 | .69 | .52 | Verbal (V) 
1, Sompletigg ^? 
* Analogies © .02 | .61 | .54 | Verbal (V) 
в ТЕБУ «50 | .83 | .54 Raining 
Ian 
: Verbal Coe. | 78 | «75 | -59 verbal v) 
Numo; " 75 7 7 
й aa |. 2 = 
u ponent cal com- Б 
У роп com. | 7L | -88 | -73 — 
* Total E .85 | .58 = 
179 | 192 | .78 = 
1 
COntai 
in : " T 
compono S data relative to its reliability, 


0: 
these erii and the subtest predictors of 
А "he ss Ponents, 

Vas as cond measuring device employed 
(LRI) called “Life Experience Inventory’ 
wi Poses, тресацу construeted for project 

Шер Contains 228 objective items, 0: 
foe Btinuum Pear to have their options on 
пец 2elusion and 113 do not. Those chosen 
they PS 1,000 were selected from a file of 
there арраге items on the basis that (a) 
(by re but content was such as to imply 
agni Mision: to intellectual change, an 
{аш а ously tried, they had validated 

tor, Seemingly relevant criterion OT 


D 
ata Collection 


Dar! the Droi 
Vets, 2g lect. Of the 96 ultimate partici- 


Woy” O pe 
West E their time was used for test and 


lei 
| rendered, 


› and tha 
their examiners received 
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per S and travel expenses for their services. 
Most examiners were members of the Ameri- 
can Psychological Association, all were com- 
petent to administer group tests, and each 
was provided with detailed instructions for 
the administration of Alpha and the Life 
Experience Inventory. 


Component Scores 


In the interest of parsimony, intelligi- 
bility, and the enhancement of generaliza- 
tion, shifts in Alpha scores are reported in 
component score form wherever appropri- 
ate. As a consequence, life-history items 
potentially diagnostic of shifts in intellec- 
tive abilities were selected for their demon- 
strated relationship to a temporal shift in 
one or more of these component scores. 

As previously described (Owens, 1953), 
all raw scores on each of the eight subtests 
of Alpha were referred, prior to any compu- 
tation, to appropriate norm distributions 
for 1,000 cases and were “read out” as nor- 
malized standard scores. To combine these 
eight transformed subtest scores into com- 
ponent scores, Guilford’s (1954) report of 
his factorial analysis of Alpha was utilized. 
In it he employs multiple regression to 
weight three independent sets of subtests 
in such fashion as to optimize their rela- 
tionship with the factor on which they orig- 
inally loaded most highly. Since Guilford's 
weights were in raw score form, they were 
multiplied by the standard deviation of the 


norm group in each case to convert them 
ights. The minimum 


(R), component score: 
the given subtest scores 
three administrations of Alpha (1919, 1950, 

and 1961). . 
No criteria, external to Alpha, were avail- 
able in terms of which to document the 
ent intellective changes 


reality of the appar 
implied by shifts in test scores. However, 


в in the component Alpha scores them- 
selves provided the criteria against which 
the items of the Life Experience Inventory 
were validated. Thus, a hypothetical change 
could occur in the V, N, or R component 

it could also occur 


or in the total score; › 
over either the first time period (1919-1950) 


or the second (1950-1961). 

An approach of this sort implied at least 
two questions. The first is—are several 
criteria, all derived from scores on the same 


- Total, 1961-1950 
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TABLE 2 
CRITERION INTERCORRELATIONS FoR Two Time PERIODS 
7 8 
Component | 1 2 2 3 4 | 5 6 

7 ft | 28 
1. Verbal, 1950-1919% | —.16, .19 | 76 Es DB "m "M ort 
2. Numerical, 1950-1919 18 | 17 EY хел, 7 = = 
3. Reasoning, 1950-1919 71 “18 =-10 = "-— 
4. Total, 1950-1919 | | ‘ol | = 12 38 = 
5. Verbal, 1961-1950 | | | | | m 
6. Numerical, 1961-1950 1 | ү ae 
7. Reasoning, 1961-1950 | 
8 | | 


^ The “Verbal, 1950-1919” 
score, across all Ss. 


in Table 


noted that considerable indepe 
indeed, the rule 


rected for |] 
the Spearman 


s the most direct 
desired to meas- 
ure—changes in test score over time, 

subtest content 
nd the 51 quoted 
are likely to be underestimates of the in- 


е, the median 
intercorrelation of Subtests 3, 4, 5, and 8 


the coefficients 


(3) The test-retest intervals involved are 
very large, particularly the first one; so that 
à great deal of true change in Ss, differential 
by subtest, is undoubtedly assigned to error 
variance or unreliability. 


(4) Only group differences are of pres- 


1 


A : = d 
criterion is the 1950 Standard score minus the 1919 standar 


- А із із 
?nt concern; no individual diagnosis 
involved. > е 

(5) Unreliability in the difference ira 
as criteria for the analysis of lifo- than 
items, should cause conservative rather 
radical errors, 


Statistical Analysis 


s 
All computations involving the test au 
Were performed on the normalized stan the 
Scores for the subtests and total or 9B The 
Component scores derived from them. able 
Dorm distributions of 1,000 compe 
cases were arbitrarily assigned a mean are, 
and a sigma of 1. Shifts in mean к 
thus, reported in units of the norm stan was 
deviation. Significance of these ыш tesi 
evaluated through application of the ores 
to a given distribution of difference sc e- 
to determine whether or not its едн tai 
parted Significantly from that of a di: 
ution with mean Zero. i 
The significance of the affect of the ave 
age increment on the magnitude of “was 
vidual differences in a given function 1961 
evaluated by Obtaining a 1950 and а ance 
variance, and by testing for the signis. 
of the difference between these corre ibe 
Variances in the fashion jointly deser 
by Pittman (1939) and Morgan (1939). the 
The comparable effect of age on ces 
Magnitude of trait ог “profile” differens, 
Was determined by computing two ach 
Scores for each S, one for each testing, test 
equal to the variance of his eight su 
Standard scores about their own me hese 
distribution of the differences berpa ani 
90 Series of V scores was then obtaine gua 
its mean tested for significance in the U 
way. 


An 


У tes 
analysis of the life-history correla 
of th 


" wa 
shifts in Alpha scores over time 
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ma " 
the oe relating response on each item of 
criterion " successively, to each of the eight 
1919-1959 c» is (V, N, R, or T for either 
: one 1950-1961). In the case of the 
regarded ntinuum items each option was 
checked - E: a dichotomy, checked or un- 
© otomized пе given criterion was also di- 
significance’ and a direct binomial test of 
ever, no T was applied (Siegel, 1956). How- 
evoted ne attention will be herein 
he numb, the noncontinuum items, since 
could кы ш " them which discriminated 
Rice er nave been accounted for as 
n the event, 
Metri 159 of the continuum items, prod- 
it AEn correlations were run between 
те and each of the eight criterion 


Scores, М 
With ers items which did not correlate 


leve] op criteria above the 20% probability 
lev, vith one criterion above the 10% 


e А 
(105 Vere eliminated. Brozek and Ticde's 
the probers was then applied to determine 
be ae of obtaining the given num- 

сапібсапь items by chance. 

Phase of «imitation of the item analysis 
vie Ss md study is that the number of 
шау ved (N = 96) is so small as to 
tion, [o Preclude meaningful cross-valida- 
t Pract D? since the number is fixed, 
9 оба cal issue seems to be one of how 
E minim a maximum of information, and 
dataa DUM of misinformation, from the 

The fiie 
qe шо Уйа precautions were taken in 
mance in St of minimizing the role of 
Уро око the obtained results. (а) Prior 
‘ticipated regarding the outcomes to be 
inter eed Were set forth, and positive 
aden, Eg are restricted to the coin- 
En results with expectation. (b) 
bli) Nd sj interpretations are tenuous at 
S ed the gi nce the hypotheses offered im- 
ег ‘act Ccessity, the discriminating items 
pus mud n to obtain a more parsimo- 
ozek and generalized interpretation. (c) 
it, sure " Tiede’s (1952) test was applied 
so s TREN the number of discriminating 
it пару, b ified was larger than could rea- 
Was felt attributed to chance. Given this, 
ot that the interpretation of factors 
it, ап ek greatly affected by the presence 
m.( y pa “chance-discriminator” 
i » 6 noted in the preceding para- 
ч wera dards set for initial fino web 

The ne atively high. 

the 9 oe method was applied to a 
ет the elements of which were 
ie Lg prating continuum type items 
Nee боа - plus the eight criterion dif- 
“Ores of Alpha. Communalities 
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were estimated by the image-covariance 
method, and 20 faetors were extracted ac- 
cording to the principal axis method (Har- 
man, 1960). Latent roots for these 20 factors 
were plotted for inspection and both five 
and nine factors were rotated to oblique 
simple structure employing the biquartimin 
criterion (Carroll, 1957). A panel of eight 
judges was then utilized to aid in interpreta- 
tion. Three judges saw only Rotation A, 
three received only Rotation B, and the 
remaining two judges saw both. Decisions 
regarding interpretability and naming were 
based on a consensus of the judges. 


RESULTS 


Level of Abilities 

The first major finding of this in- 
vestigation is that the Alpha scores 
of these 96 Ss remained remarkably 
constant over the period from 1950 to 


TABLE 3 
TEMPORAL SHIFTS IN SUBTEST, COMPONENT, 
AND ToTAL ALPHA SCORES 


Subtest or component p i 

1. Following directions | .001 | .012 
2. Arithmetical problems | — 237 |—.436 
3. Practical judgment —.118 |—.192 
4. Synonym-antonym —.049 |— .072 
5. Disarranged sentences | — 036 |—.052 
6. Number series com- 

pletion —.084 |—.116 
7. Analogies .013 025 
S. Information .043 | .089 

Verbal component —.032 | —.90 

Numerical component | — .203 |—4.38** 

Reasoning component .015 .28 
Total —.047 |—.138 

Note.—M = 5; SD = 1 

** = p < 0l. 


1961.2 Table 3 contains the mean dif- 
ferences and associated t values. It 
will be noted that at the subtest 
level none of the differences is signifi- 
a 

2 Means of those tested in individual ver- 
sus group situations did not differ signifi- 
cantly, although the former appeared slightly 
but consistently higher in 1961. 


MEAN STANDARD SCORE 
wu 
ч 


YEAR 
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—-— Verbal 


Numerical 


—— Reasoning 


© 1961 


Fic. 1. Test performance trends for Components and total. 


cant; indeed, of the apparent, “ 
five are negative and three p 


mponents, the 
largest subtest 
d to produce a 
shift in the N 
tal score, how- 


largest and third 


years ago should be 
the last decade than these declines 
are. The definite implication is that 
any declines which 
done so more recently. A possible ex- 
ception is the case 
nent. Here, 
performance 
at some point within perhaps 10 years 
of college graduation. The speculation 
is lent credibility by the fact that the 


Sample is heavily loaded with engi- 
neers who had no doubt reached some 
Peak of quantitative sophistication 
at college graduation; and by mi 
knowledge that many of these e 
would shortly have moved into ad- 
ministration, into less technical < 
Pects of engineering, or out of the Ex 
entirely. Any such move would омии 
lessen their need for, and ability 
Maintain, quantitative knowledges an 
skills, 8 
The second hypothesis stated Me 
that there Would be no significan 
cange in overall level of score from 


1950 to 1961. This hypothesis is sus- 
tained. 


Individual and Trait Differences 


Data pertaining to the third hy 
pothesis stated appear in Tables 4 y 
- In the former, a comparison of Is 
Variances for 1950 and 1961 uan y 

at four differences are appa at 
negative and four positive and - is 
None is statistically significant. l nt 
therefore implied that no significan 
Shift in the magnitude of individu. 
differences took place over the 11-Y 
Period in uestion. i- 

Table 5 contains comparable E^ А 
dence regarding the effects of 
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" TABLE 4 
EMPORAL CHANGES IN INDIVIDUAL 
DIFFERENCES, 1950-1961 


Subtest | 19506? | 1961 2| =° 

1. Followi " 
2. Arit Ing directions 6 742 
3, Pathmetical problema os 1 288 | Li 
Y вуса judgment 1578 609 1347 
5. Dion m-untonyim 1083 | 1900 | 1:908 
6. Numpatged sentences .656 844 1.604 
т Analogies 8 completion | .701 -850 —.404 
Ton LIE AES. 

‘cores "700 ay iiss 


None of these values is significant, 
i В 
эш increment upon trait dif- 
Proached Significance is not ap- 
Contains " and, although this sample 
Shifts Bs larger number of positive 
ceeds t] е negative sum slightly ex- 
this Bid positive. In accordance with 
Dating ғ епсе, Hypothesis 3, antici- 
К tr D increase in both individual 
leeteq ait differences, must be re- 


T TABLE 5 
bum CnuaNaGEs IN TRAIT 
— DIPFERENCES, 1950-1961 


> 


— 
Sub 
=н oy e) Range | NÆ pg 
lth 
Tou 
Eh 8| —3.2978|.L.1.856 |+52|— -602 
—2.8870|—44 


reat peer scores in 1961. Range = 
nd ‘vidual increase and decrease in 
азе €viation units. N+ = number of 


ow І 
"Value 7198 an increase ог decrease. 
18 not significant. 


1 
x E I 
* Sa Correlates 


i 

Problem respect to the third major 
v Stated, 82 of the 115 con- 
ed y ype life-history items uti- 
Shin. Vere found to be related to 
Assumpt test score over time. If the 
ite perg is made that the criteria 
ho Теа усен correlated, so that there 
a and m one criterion, then Bro- 
at lede's test, (1952) indicates 
* Probability of obtaining 82 
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significant items from a pool of 115 
by pure chance is far less than .0001. 
However, the criterion intercorrela- 
tions of Table 2 show that the “one 
criterion" assumption is unlikely. 
Thus, a more conservative and realis- 
tie probability estimate may be ob- 
tained by assuming complete inde- 
pendence of the eight criteria. If this 
is done, then the total number of pos- 
sible item-eriterion correlations is 920, 
to be compared with an obtained num- 
ber of significant item-criterion cor- 
relations of 214. Given these figures, 
and the 10% probability level as the 
mean item discrimination level, this 
outcome should occur by chance far 
less often than one time in ten thou- 
sand. It thus seems safe to conclude 
that the first, general hypothesis of 
this investigation is tenable; that is, 
there are significant numbers of life- 
history correlates of shifts in Alpha 


score over time. 


Factor Analysis 

Since no analysis of the noncon- 
tinuum items of the L.E.I. was indi- 
cated, it seems important to note that 
the 82 continuum items gave very 
good coverage of the item categories 
originally established. As previously 
noted, these 82 life-history items, plus 
the eight criteria, constituted the en- 
tries in a 90 x 90 correlation matrix 
to be factored. Concurrent with the 
extraction of 20 factors from this 
matrix, Figure 2 contains a plot of the 
latent roots on which the apparent 
inflection point has been indicated. 
A decision to rotate both five and 

ine factors followed from an exami- 
nation of this plot. Of the two, the 
five-factor rotation proved much 
more interpretable to the judges. The 
three who attempted to interpret the 
nine-factor rotation showed little con- 
sensus, particularly for the last four 
factors, and complained of ambiguity 
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Fic. 2. Latent roots for 20 


and overlap among them. 
other hand, the 


only the five-factor rotation, 
two who saw 


tors to five, this evidence led to the 
Adoption of the five-factor rotation 
Although it was 


» nominally, an oblique 
iquartimin cri- 


-11 to — 99 With a 


median midway between 04 and 


—.05. 
TABLE 6 
NUMBER or VARIABLES LOADING on Given 
Factors 
E P = 
Factor кей loading 
Ï 17 2 
TI 13 4 
III 14 | 1 
IV 15 | 1 
у 11 | 3 


factors prior to rotation. 


The number of items with mA 
of 30 or greater on a given fac de 
and the number of criteria with a 
ings of .20 or greater, appear in Leer 
6. It should particularly be o 
that although four criteria one 
above 20 on Factor II and only fac- 
on Factors III and IV, still, bye the 
tor has at least onc loading o 
given magnitude. f five 

In what follows a consensus 0 rial 
judges who reviewed these aa tet 
results has been employed in the =: «е 
Pretation and naming of the — 

he sense of each item has jste 
briefly indicated, and the items n 
in the order in Which they load entage 
factors. The approximate percen it 
of the common factor variance also 
counted for by each factor has 
been поёед з 5 

Factor І, Socioeconomic gt 
was the name assigned to the te 
and largest, factor, which nage 
for approximately 12% of ШЕ © re- 
mon factor variance. The item 


наг wing 

Sponse pattern implies the follo 
F jance 
* Percentages of common factor varian", 


roxi- 
accounted for should be regarded as арр 


„mally, 
mate, since the solution was not, fort 
orthogonal. 
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characterization of high scorers on 
e factor. 


Characterization of Ss Loading 
y Date high occupationally 077 
: Hold а large amount of life in- 
А Surance -75 
" Бетен а large income 4 
B lave moved up occupationally — .67 
a Live in a large city .63 
8. Supervise people 57 
T.: refer to have many projects “оп 
š ie fire” simultaneously .51 
: $ave a large part of present in- 
come -41 
: Prefer to work with others, 
10 ур than alone .39 
a goi faster than most 39 
+ Goals have been fulfilled or sur- 
12 passed -37 
1s Dislike direct supervision .35 
ii Married early -35 
> College contributed to life's 
15 Eoals 34 
* Receive ample recognition for a 
1g, 8990 job .32 
+ es disturbed by leaving some- 
17, p 28 unfinished 31 
> Parents granted them great in- 
.31 


dependence in high school 


viis characteristic most clearly im- 
ceu Seems to be a high level of 
The HOUR] and financial success. 
Oneg are also understandable over- 
tive d What might be called “execu- 

P ao exemplified in a desire 
ang, mal external control, accept- 


ley. Of responsibility, and a high drive 
Sl at work. 
“ctor ІТ, This second factor, 


8 ‘ch accounts for approximately 
ag of the common factor variance, 
Renee esignated as Sensitive Intelli- 
Tn 5 Again, the item response pat- 
zation Plies the following characteri- 
high scorers on the factor- 


ОТИ | 
1 “terization of Ss Loading 


2. pbset by some tragedy in youth 
ite ognize a number of Alpha 
3. iem from past administrations -48 
1. Те үе write down ideas .42 
5 Jokes frequently 
$vote much time to reading 
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Characterization of Ss Loading 
6. Are sometimes tense and nerv- 


ous .40 
7. Keep some records and budget 
expenses .39 
8. Are quite religious .37 
9. Bring problems from home to 
work .35 
10. Have long job tenure .88 
11. Often feel dissatisfied with them- 
selves .33 


12. Regard recent past as happiest 
period of life 
18. Health has been a social handi- 


cap 

A hypothetical individual making 
these responses evidences above-aver- 
age intelligence in his devotion to 
reading, his superior memory, and 
his production of ideas judged worth 
recording. Coupled with these, how- 
ever, is a syndrome of emotionality 
and sensitivity which may Бе in- 
ferred from a stated youthful trag- 
edy, some tenseness and nervousness, 
some dissatisfaction with self, and 
some perseveration with problems. 

Factor III. The judges described 
and labeled this factor as Physical 
Vigor. It accounts for a little less 
than 8% of the common factor vari- 
ance. Items which load heavily on 
the factor imply the following char- 
acterization of a high-scoring re- 


spondent. 


81 
.30 


Characterization of 8s Loading 


1. Describe their physical vigor as 


above average .70 
2. Say work is not tiring | .49 
3. A typical day does not tire them 
out .47 
4. Have taken advantage of their 
opportunities .46 
5. Concentrate on the technical as- 
pects of the music at a concert .43 
6. Rarely feel dissatisfied with 
themselves .43 
7. Feel more than adequate for 
their jobs .40 
8. Say health has rarely, if ever, 
kept them out of activities .38 
9. Consider themselves more suc- 
cessful than average in things 
attempted E 
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i implied 
Characterization of Ss Loading seems clearly relevant to toa à 
10. Feel they worked up to potential lack of social enjoyment and a 
in college .97 


11. Have rarely had an idea they had 
to write down before forgetting 


it .37 
12. Formed occupational goals early 

in life .35 
13. Feel they don't receive adequate 

recognition for a good job .33 
14. Earned most of their college ex- 

penses :31 


Here, stated physical vigor, work 
attitudes, self-perceptions, and health 
history all suggest the appropriate- 
ness of the assigned name. 

Factor IV. Introversion was the 
label assigned to this fourth factor, 
which accounted for about 6% of the 
common factor variance, The follow- 


ing response pattern defines the fac- 
tor. 


Characterization of Ss Loading 


1. Do not particularly enjoy social 


gatherings 


-50 
2. Do not get together socially with » 
usiness associates 47 
3 Feel People are not all good 45 
4. Lived in a town or City, rather 
than a rural area, prior to college 42 
5. Consider themselyes fairly tense 
or neryous -43 
6. Mother died at an advanced age 
or is still living .42 
Prefer to work on one thing at a 
time 4] 
8. Have experienced major Surgery i 
more times than the average S 38 
9. Health has sometimes inhibited ` 
social activities 38 
10. Are considerably disturbed if ' 
something is left unfinished .34 
11. Usually ignore treatment for 
colds 


y .33 
12. Have turned down social engage. 
ments because they were tired 


.32 

13. Activities have been suspended 

because of accident more often 
than average S .32 

14. Are reluetant to express their 
views in a Eroup situation 31 

15. Have become less religious with 
age .30 


The name assigned to this factor 


ity, somewhat negative attitude to- 
wards others, possibly overlong asso- 
ciation with mother, and sape 
of poor health, low energy level, an 
social reticence. 

Factor V. The fifth and final factor 
was called Egocentric Independence, 
and it accounted for a bit less. than 
6% of the common factor — 
The characterization implied by the 
items of this factor is as follows. 


ing 
Characterization of Ss Load 


1. Consider their own feelings be- 
fore those of others 

2. Not engaged in social welfare 46 
activities : 
Prefer to work alone 


3 
4. Are less religious than the aver- 42 
аре 5 


5. Frequently spend an evening at E 
home in leisurely reading 

6. Consider their reading speed to 41 
be above nverage 

7. Seldom make specific plans .38 


regarding the future 


8. Require eight or more hours of 34 
Sleep 


9. Do not feel their fathers were 
particularly successful 


Life's work has fulfilled or sur- 33 
Passed goals 


11. Have never found it necessary tO 


бер exact financial records OT 31 
udgets 


т 
A very high scorer on ihe ү” 
Would appear to be characterize a 
Some selfishness and solitariness, et 
Preference for working alone, by à 
absence of religious belief, and M e 
feeling that he has done well, P ar 
ably better than his father and be 
an he himself expected. - Ta- 
‘actor-criterion relationships. ce 
le 7 contains the critical арте 
Tegarding relationships of Alpha Jes- 
Ponent gains, representing inte rs; 
tive changes, to life-history ee 
Tepresenting demographic rat 
isties and experiences. Given t the 
data, it is Now in order to recur to 
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"— TABLE 7 
5 
7.20: or Арнал Component GarNs ох Lirr-Hisrony Facrons 
| ма 
А Verbal d i i 
ООЛ | [o mem ns 
Е | Pi P: Pi P: Р: Р Р: Р 
+ Socioe SQ 
ilt Pes economic Success | 52 
ш ee intelligence | 31 i 
lv 1ysical vigor | 5 Б ч е 
ү: p troversion m 
e independ- > 
Т = .50 | .36 .25 
Ote.—P, = 
^ This e = 1919-1950; Pe = 1950-1961. 
was chosen because of the priorly noted unreliability of the component gains. 


Temainj 
riginalty four of the hypotheses 
65, enunciated. 
іоесопо 8 of these asserted that so- 
of Li PRIUS would be a corre- 
Shows intellective change. Table 7 
ference ex V and R component dif- 
ne К es and the total score load 
actor б, nd .23, respectively, on the 
first yos Success during 
кым Реной (1919-1950). 
nfirmed 8 4 may thus be regarded as 
e 
побуда hypothesis specified that 
9 be re} nal factors would be found 
Vill be Wen to intellective change. It 
th tompon aa in Table 7 that the 
the tota] s difference score and 
ї © factor oth load substantially оп 
i ег fo Sensitive Intelligence, the 
th ad both time periods (1919- 
а this ¢ 1950-1961). To the extent 
cra Seness actor involves early trauma, 
sation wi and nervousness, persev- 
c sfaeti ith problems, and some dis- 
rona Gane with self, it has a motiva- 
i Or is ponent. However, since the 
Us, considerably broader than 
Каде д ретт 5 may be safely ге- 
ga lie oat tentatively confirmed. 
heleally 1 hypothesis was that or- 
вар iu conditioned decrements in 
fe per would be found to 
erfor о a relative decrement mm 
mance from 1950 to 1961. 


° 
le а 


It may be noted in Table 7 that the 
numerical component difference score 
did load .35 on the factor Physical 
Vigor, but that it was for the first 
time period (1919-1950) rather than 
the second. The hypothesis, as stated, 
is thus rejected, although the ob- 
served relationship is of interest in its 
own right and the subject of a subse- 
quent speculation. 
The seventh hypothesis was that 
the social introvert would be less dis- 
turbed than his fellows by some essen- 
tial age-related disengagements, ap- 
t during the second time period, 
hould thus perform relatively 
better on Alpha. Here Table 7 does, 
indeed, reveal that the total differ- 
ence score loads .29 on the factor of 
Introversion. The hypothesis, never- 
Id probably be regarded 
oubt for two reasons. 
First, the loading of .29 is not large; 
stands alone. At best, 
tative sort of confirma- 
indicated. 
it should be pointed out 


paren 
and 8 


not anticipated. 
fairly substantial loadings on it, and 


no prior hypothesis, some comments 
regarding the possible significanee of 
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the relationships involved appear in 


the Diseussion section which follows, 


Discussion 
Interpretations 


Some implications of the data pre- 
sented are incompletely documented 
but of possible significance and con- 
cern. 

(1940) 
pointed out that cultura] changes 
might well be mistaken for age 


the implied question so 
Inquire what i 

may be expected to do 
abilities of individu 
culture, Cast in th 


“true” or i 
in the individual, Separate and apart 
from the accompanying ch 

the culture. One of Campbell’ 
quasi-experimental i 
adapted to this purpose. Tt requires 
that one "baseline" the apparent age 
changes noted by measuring a sam- 
ple of present 19-year-olds compara. 
ble to the Ss of this study at their age 
19. Assuming comparable individuals, 
one may then attribute group differ- 
ences observed to cultural change, If 
these differences are subtracted from 


‘The writer is indebted to Campbell for a 
timely suggestion along this line. 
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apparent or observed changes in the 
same individuals over time, the resid- 
uals should represent “true” or in- 
trinsic changes in these individuals. 
To fulfill the indicated purpose Alpha, 
Form 6, was administered during 
1961-62 to a random sample of 101 
male freshmen at the same institu- 
tion attended by Ss of the longitudi- 
nal study, that is, Iowa State Univer- 
sity. 

Table 8 contains the relevant data: 
Column 2 shows total “apparent 
gains from 1919 to 1961. Column 3 
contains as “cultural changes” the 
Scores of the 1961-62 freshmen minus 
those of the freshmen of 1919. Under 
“true gain," by subtest and com- 
ponent, appear the differences be- 
tween Columns 3 and 4. These cor- 
rected component gains compare wit 
the uncorrected total gains of appr ae 
mately 5o on V, —3e on N an 
:2e on R shown in Figure 1. It !$ 
thus, in the case of R that the correc- 
tion introduces the most фаш 
change. The explanation may well г 
fairly simple, The Analogies test, Т, 
example, was probably quite unfam! 
lar to freshmen of 1919, as were ^ 
Stroup tests of intelligence, In 1961 " 
was not unfamiliar to either 19-year- 
olds or 60-year-olds. Thus, what ap 
Deared азд small intrinsic gain in " 
for the 61-year-olds may be better а” 
*ounted for in ternis of а vastly i 
creased genera] level of sophisticati 
With this type of test, in the mide 
of which the senior Ss really 108 
ground in the relative sense. Е 

П апу event, the corrected com 
Ponents seem to be in better accor 
with both theory and practice than 
10 uncorrected. For example, 2 
Tloration i. commonly measured E 
terms of а discrepancy between Ye 
bal and reasoning functions (1026118 
1958); and Cattell (1963) cies: 8 
Surely anticipate an age decrement ng 
‘fluid intelligence” (R-related) lo 


A 
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"— TABLE 8 
матер “TRUE” on Intrinsic CHANGES IN ALPHA Score, 1919 то 1961 
Subt Appa i Mangane 
“ РШ” | дыя 
Subtest Com; 
Practical : | _ ponent 
Упопу Judgment .42 18* 24) 
арр ноута | .50 00 50 
anged в 5 4 j — 
Information sentences «0 Fu = .87**(V) 
Arith | | .97 =, 1.00 
N orien] problems —.34 —.31** —.03 
т Series completion —.0$ .63** = E IB") 
Owing lirecti » j 
nalogies Mirections .05 .03 .02 NT 
Total 16 90" -.74 ЧА) 
.90 Ht 06 
^ К 
» = of 101 male freshmen of 1961 mi M of 9 res 
"LIPS 5 minus М of 96 male freshmen of 1919. 
= P< 0], 
bef 
‚ Ого д " " 
intelligens decrement in “crystallized of males at perceptual problems, and 
ce” (V-related). have observed that it seems to stem 


а matter requiring com- 
Yelationehy possible interpretations 
not nships displayed in Table 7 
Pothese Anticipated in the stated hy- 
tains | 5. The first Р» column con- 
of 7 One of these, a verbal loading 
Penden 0 on the Egocentric Inde- 
Peculate factor. One is tempted to 
facto: € that high scorers on this 
be ese ^ defined by the items, would 
Yolunta Dvolved in social interactions, 
Others Y and involuntarily, than 
Vocabu] so, less breadth of general 
ence ary could well be a conse- 
am. 
Wor, Ond complex of relationships 
table : comment appears in the same 
P Ae. the Numerical Compo- 
һувуы у Opposite Factors III and IV, 
s dence Vigor and Egocentrie Inde- 
4 ers - The question is why high 
im lattes, these factors should tend 
Sw Ws. their N scores more than 
SWer ers on the factors. A possible 
Ma Кы seem to be because they 
А ate the problem more vigor- 
a the context of sex differ- 
» (gd, P'oblem-solving, Witkin et 
) have noted the superiority 


from a more aggressive, vigorous, and 
persistent, attack. If it is true, within 
a sex group, that the possession of 
male characteristics argues for a more 
masculine and a superior performance, 
then it is pertinent to observe that 
physical vigor and egocentric inde- 
pendence are appropriately masculine 
characteristics. 

Still with reference to Table 7, it 
seems both important and appropri- 
ate to note that the factors which 
argue for improvement in V over 
time are not, apparently, the same as 
those which argue for improvement 
in N or R. Indeed, considerable speci- 
ficity would scem to be the rule. 


Evaluation 

Many of the problems attendant 
upon the conduct of an investigation 
such as the present one have been 
priorly dealt with (Owens, 1953) and 
the writer has nothing of substance to 
add. However, two which persist and 
recur should be treated here. 

First, there is the ever-present 
uestion of a bias in the sampling of 
Ss. Of the 127 Ss of 1950, 96 were 
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TABLE 9 
TOTAL AND COMPONENT ALPHA DIFFERENCE 
Scores or Testep AND UNTESTED 
SuniEcrs (1950-1919) 


Component Dr | Du P 
Verbal .631 .679 .02 
Numerical — .059 —.151 -77 
Reasoning .150 —.035 1.99 

Total .583 447 1.49 


Note.—M = 5; SD = 1. For tested Ss, 
N = 96; for untested Ss, N = 31, 
^ None of these values is significant, 


actually retested; 13 were deceased, 
8 refused retesting, 5 were disabled 
by accident or illness, and 5 could not 
be located, Although much of it was 
unavoidable, the total loss was 31 
cases, and the question of how this 


ted the reported 


it will be noted that 

Y is of the order of 
:08о. Thus, had the present bias ex. 
initial Tetesting, this 
would have been its magnitude, This 
is clearly not an estimate of the bias 
introduced into the difference Scores 
of 1961-1950. However, it should 
overestimate the later bias, since gains 
were much larger at the first retest- 
ingthan at the second. 

The second persisting problem js 
one of the extent to which the speed 
requirements of the Alpha test have 
conditioned the results, In anticipa- 
tion of an attempt to evaluate this 
problem, all Ss were given the exam- 
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‘ination on both standard time and 


double time bases. This was done in 
such fashion as to keep the two scores 
independent, and to avoid any altera- 
tion in the prescribed procedure for 
the initial and standard administra- 
tion. The method outlined by Helm- 
stadter and Ortmeyer (1953) was 
then employed to indicate whether or 
not a given subtest was significantly 
speeded, significantly powered, or nel- 
ther. In essence, the method utilizes 
items omitted on standard time as 
an index to speeding, and items wrong 
on double time as an index to power. 
Given these two indices, one may sub- 
tract the latter from the former, 
by S, and generate a single distribu- 
tion of scores to be tested directly tor 
the significance of the departure 0 
its mean from that of a distribution 
with mean zero, 

Table 10 contains a summary om 
sulting from the application of this 
method to the data of both earlier те 
later retestings. It will be noted tha 
there are both more speeded and more 
Powered subtests in 1961 than in 
1950. Of those Which appear for = 
first time jn 1961, Number Serie 


TABLE 10 X 
SPEED Anp Power Inpices ом THE ARM 
ALPHA SUBTESTS 


. | Speed- 
Won | Poe 
Subtest bod “son 
LE 
3. Practical judgment Sg 8 s 
Js nalogies 88 ss 
4 Synonym-antonym 5 
9. Number series comple- 5 
tion 
5. Disarranged sentences P 
2. Arithmetic problems RI PF 
8. In ormation EE 
= :01; 
g 2S8 = Speeded, t test significant at 0H 


Speeded, t test significant at .05. 
PP = Power, ! test significant 2 
Р = Power, / test significant at .05. 


b t 01; 


AGE AND MENTAL ABILITIES 


Completion is speeded, Arithmetical 
roblems is powered, and both load 
on the N component. In addition, the 
Subtest on which the '50 to °61 ap- 
Keng loss was largest, Arithmetical 
ү. огей, became more powered. It 
i Seems difficult to relate changes 
Es Ipha test performance to changes 
n its speed-power composition. 
ju concluding, two incidental im- 
or inns of this study seem note- 
MON First, cross-sectional and 
à Situdinal studies of the effects of 
a on mental abilities have yielded 
pe gent and apparently contradic- 
татын suits to data. Table 8 illus- 
"dea а rationale for their useful and 
Poe ud integration. Second, the 
s Onstration that patterns of liv- 
to S moderate. the relationship of age 
був ental ability implies that cogni- 
ent nes like cognitive develop- 
on tE 18 conditioned to some extent 
тоң lé nature and intensity of envi- 
Mental stimulation. 
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STRUCTURE OF INTELLIGENCE IN NEGRO AND 
WHITE CHILDREN: 


IRA J. SEMLER 


Cedar Rapids Community School 
District, Iowa 


Performance levels on the WIS 
(PM) and the factorial struct 
white and Negro children. WI 
141 white and 134 Neg 
7-, 8-, and 9-yr.-old Ss wer 
Ss had higher WISC FS IQs th 
and higher PM total scores att 
8- and 9-yr. levels. While the 
matrices appeared to be simil 
showed heterogeneous dispersi 


AND 


е also given t 


IRA ISCOE? 
University of Texas 


analysis indicated statistically dissimi- 


te and Negro samples, separate factor 
analyses were computed to exp] 


esearch 


for example, 
extensively (Dre t: Mi a 
Garrett, i947; Si sw, ee 0 
Bright, 1954); h 
for these differe 
controversial topic in the li 

Although Garrett (1947) st ete 
(1958) have offered genetic explana- 
tions for the observed differences, the 
importance of the psychocultural en- 
vironment and social learning in early 
childhood is being stressed by others 
(Dreger & Miller, 1960). It is highly 
probable that changes in economic 
conditions, early intervention with 
innovative educational procedures, and 


! This research was partially supported 
by a grant from the Human Genetics 
Foundation. 

? Special appreciation is expressed to 
Diane DeLancey for computational help 
and preparation of tables and manuscript. 


n ] ore the nature of the differences. The 
ation matrices for whites and 


Negroes were highly 


a 


increased Opportunity to learn will 
produce a different yield from cross- 
cultural studies of intelligence than 
has been reported previously. While 
comparative studies of intelligence are 
hazardous, seriatim measurements 
taken in conjunction with new educa- 
tional procedures instituted for "in 
ferior" children would have some use 
fulness in evaluating the results 0 
changing psychocultural influences: 

. e choice of research instruments 
im comparative studies of intelligence 
18 extremely important since the con- 
structs and Operations defining ^in 
telligence” may vary markedly from 
test to test and may be related 1 
different Ways to the between-race 
effect under Study. Also, the com 
plicated relationships among intelli- 
gence, learning, and the ability to cope 
with environmental conditions are 9 
best unclear, Further, functional intel- 
ligence or the intelligence needed to 
Survive in a certain environment unde" 
Certain conditions involves attributes 
Dot sampled by some tests. For a 
ample, Young and Bright (1954) han 
Suggested that the Wechsler Inte t 
gence Scale for Children (WISC) is r° 
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а for Southern Negro children 
ine they seem to be functioning 
li ay in their environment even 
IQs x they make significantly lower 
thet p^ white children. This suggests 
каше. he mental ability estimates 
É ided by some instruments may be 
un ори for individuals who, for 
Pierii reason, have need for a 
Ee lonal intelligence quite different 
tion that needed by the standardiza- 
ор ae PURO, 'This is no argument 
m a so-called “culture fair" test of 
Mm gence since it is almost impossi- 
o construct and it certainly misses 
+ point in intellectual assessment 
f & Miller, 1960); however, 
Signes need for further comparative 
ih е " designed to explore differences 
р asured intelligence as a function 
psy e. instrument and changing 
c бешш! conditions. 
* principles underlying the con- 
to ation of some tests come closer 
Don: € functional intelligence concept 
diffe Others and may provide a much 
is ent assessment as a consequence. 
ap, харе, the WISC supposedly 
"il a ReAl intelligence but may carry 
non ng loads of memory, drive, and 
Mmtellective factors (Wechsler, 
чека Littell's (1960, р. 144) review 
and ests that the WISC Verbal (V) 
acto :erformance (P) scales are not 
ko orally pure but contain additional 
educti, which might be labeled general 
Parti Ve ability and memory. In 
n cular, the WISC V scale contains 
ait loadings which reflect previous 
1959) ence and education (Cohen, 
Matri, In contrast, the Progressive 
map, S (PM) was based on Spear- 
1947) cognitive principles (Raven, 
Busco and was designed to be less 
lea, Ptible to the influence of previous 
есуу and experience. Since the PM 
reas the ability to observe and 
Ound е relationships within the 
self ;; 21165 of the testing situation it- 
> 1t may provide results more closely 
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related to functional intelligence than 
the WISC. 

The present study extends some 
previous research (Semler & Iscoe, 
1963) which found that the paired- 
associate (PA) learning (pairs of 
objects and colored pictures of these 
same objects) of Southern white and 
Negro children did not differ signifi- 
cantly at the older age levels studied 
(8- and 9-year-olds) despite the signifi- 
cantly lower WISC Full Scale (FS) 
IQs of the Negroes. This study ana- 
lyzes the performance of these same 
white and Negro Ss on the PM at the 
7-, 8-, and 9-year age levels for com- 
parison with the analysis of the effect 
of race on WISC performance. Since 
the PM requires within-test learning 
involving cognitive processes and 
ostensibly does not carry a heavy 
psychocultural loading, it was hypoth- 
esized that between-race performance 
differences would be less on this test 
than on the WISC. Comparing the 
intercorrelation matrices of the white 
and Negro samples for the WISC 
and for the PM was of major import- 
ance in this study. The basic analyses 
tested the similarity of intercorrela- 
tions within each cell of the combined 
white-Negro matrices. Further, multi- 
variate analysis of variance and factor 
analysis procedures were used to test 
the similarity of WISC structure for 
the white and Negro samples. 


METHOD 


Subjects 

The subjects (Ss) used in this investiga- 
tion were 134 Negro and 141 white children 
ranging from 5 to 9 years of age who were en- 
rolled in the public schools and kinder- 
gartens in Austin, Texas.’ At the time of the 
study, these schools were still segregated as 


ae 

s Appreciation is expressed to the ad- 
ministrative officials of the Austin Public 
School System, Texas, for their cooperation 
in conducting this study. 
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a result of race differences in housing loca- 
tion rather than school policy. 


Procedure 


Each child had been administered the 
WISC by a trained examiner as part of a 
study of race differences in learning ability 
(Semler & Iscoe, 1963). The five WISC у 
subtests given were: Information (I), Arith- 
metic (A), Similarities (S), Vocabulary 
(Voc), and Digit Span (DS), to provide a 
WISC V IQ. The four WISC P subtests 
given were: Picture Completion (PC), 
Block Design (BD), Object Assembly (0A), 
and Coding (C), which were prorated to give 
a WISC P IQ. All 275 Ss were tested for this 
portion of the study. 

Partly as a result of finding that white 
children had higher WISC FS IQs compared 
with Negro children at all age levels in the 
previous study (Semler & Iscoe, 1963) and 
to provide comparative data on a less cul- 
turally loaded lest, sets A, AB, and B of the 
Coloured PM (Raven, 1956) were adminis- 
tered as group (classroom) tests to tho 160 
ee were 7, 8, and 9 years of age. The 5- 

administered the 


b ility to handl 
test in a group setting. к UM 


Statistical Design 


In the second phase all i 
order correlations within and Босае, at? 
WISC and PM subtests and totals were а 
puted separately for the Negro and white 
samples without regard to age. The result. 
ing white and Negro intercorrelation mat- 
rices were combined with the appropriate 
parts of the WISC standardization Popula- 
tion intercorrelation matrix for the 7M. 
year-old level (Wechsler, 1949, р. 10, Table 
IV) for comparison. Each cell of the com- 
bined matrix contained a set of three inter- 
correlations which were tested for ho- 
mogeneity (Edwards, 1960) to determine 
within-set intercorrelation similarity. This 
type of analysis also allowed separate tests 
of the homogeneity of pairs of intercorrela- 
tions within sets, that is, for white versus 
Negro, white versus WISC standardization 
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population, and Negro versus WISC stand- 
ardization population intercorrelation pairs. 
The same type of analysis was used to test 
the homogeneity of white-Negro intercorre- 
lation sets among PM subtests A, AB, and 
B and the correlations between the WISC 
and the PM subtests and totals. 

The third phase compared the structure 
of WISC intelligence for white and Negro 
Children with a multivariate analysis of 


variance and with exploratory factor anal- 
yses.* 


RESULTS 


Race by Age Proportional Analysis of 
Variance оў WISC FS I Qs 


The means and standard deviations 
of WISC FS IQs by sex for white and 
Negro children at five age levels are 
presented in Table 1. A race by age 
Proportional analysis of variance was 
computed after randomly discarding 
Ss to equalize cell numbers (N = 24). 
The highly significant race difference 
obtained in favor of white children 
(Р = 6489, df = 1/230, p < .001) 
was supported at each of the age levels 
by an analysis of the simple effects 
using the mean square within-sub- 
groups as the error term. The age 
effect was significant (F = 6.98, af a 
4/230, p < 001), indicating differen 
mean WISC FS IQs for the combing 
racial groups at the various age levels 
analyzed. The significant Race X A8° 


^ The help and advice of H. P. Bechtoldt, 

University of Iowa, in carrying out t ae 
phase of the analysis is gratefully acknow 
edged, Bechtoldt/s modification and аер, 
810 of the multivariate analysis of yore 
(MULTAN) program of Cooley and Lohn 
(1962) was used to test the hypotheses КА 
Constant variances and constant aa 
110—5, homogeneity of dispersion, 2 т 

omogeneity of mean vectors. Further 
Bechtoldt’s comprehensive factor-analy ta 
(FACTAN) program was used to explore > 
structure of WISC intelligence for white # 
"Negro children separately. The MULT m- 
and FACTAN programs are described Bom- 
pletely in the User's Guide, University Со 
puter Center, University of Iowa. 
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TABLE 1 
Means anp Stannard Deviations OF WISC 
FS IQs ror Warre (N = 141) AND 
Necro (№ = 134) CHILDREN 
AT FIVE АСЕ LEVELS 


fs Age in years 
s Total 
5 6 | 7 8 9 
White 
Boys 
x 19 2 |12 |i | 12 80 
a 110.5 | 104.2] 97.8 | 105.1 98.6 
ой 1.5 | 15.0 12:2 | 15.6 | 13.9 
7 15 |15 | 12 | 12 61 
М 116.6 | 104.9] 97.3 | 105.8 | 94.3 
" 5 6:3] 7.7] 15.2 |16.4 
egr 
Boys 
N 
13 м |12 | | 12 т 
М 83.5 | 02.0 90.7 | 98.3 | 89.6 
Ging | 13:1 | 11-3] 81] 9.04) 7.5 
ў 13 pim |н {| |е 
be 82.5 | 103.2] 90.8 | 96.6 | 85.1 
13:94] 7.2] 12.3 | 11.2 | 11.3 
See ОО Nee a S 


p action (F = 8.79, df = 4/230, 
dime 001) indicated that the race 
he епсев in WISC FS IQs found in 
the Simple effects analysis were not 


е s 
studied at each of the age levels 


Wise Intercorrelations: Comparison of 
as and Negro Samples and WISC 
ndardization Population 


wi Possible linear correlations among 
IQ Subtest scaled scores and V IQ, 
White and FS IQ are reported for the 
Wisc and Negro samples and the 
Tabi Standardization population in 
ach 2 in combined matrix form. 
tain cell in the combined matrix con- 
One г а set of three intercorrelations, 
тоц Ог each of the three comparison 
ще Ps. The set of intercorrelations 
ach cell was tested for homogeneity 
test, A transformations were used to 
al e statistical difference between 
Within sible pairs of intercorrelations 
On © each of the heterogeneous sets. 
соу 000 set of WISC V subtest inter- 
Can: ations, A versus Voc, was signifi- 
Y heterogeneous (x? = 10.33, 
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df = 2,p < .01) with a significantly 
higher (z = 3.03, p < .005) relation- 
ship for white Ss (r = .52) than 
Negroes (r — .21). A most striking 
finding was that all WISC P subtest 
intercorrelation sets were homogeneous 
even though slight differences between 
correlations within sets are evident. 
That part of the WISC structure 
defined by the intercorrelations be- 
tween V and P subtests showed that 
five of the 20 intercorrelation sets 
exhibited significant heterogeneity. 
Three of the heterogeneous sets in- 
volved A with BD (x? = 9.47, df = 
2, p < .01), A with ОА (x? = 8.24, 
df = 2, p < 01), and A with С (x? = 
12.86, df = 2, p < 01) with signifi- 
cantly higher intercorrelations for 
white Ss than Negroes. The S versus 
PC intercorrelation set was hetero- 
geneous (x? = 6.65, df = 2,p < .05) 
due to the higher (z = 2.57, p < .01) 
correlation for Negroes (r — 51) 
compared to the WISC standardiza- 
tion population (r = 127). 

Of further interest are the part- 
whole correlation comparisons be- 
tween WISC V subtests and V IQ 
for the three groups. In three sets 
(V IQ versus S, Voc, and DS) there 
were no white-Negro differences and 
heterogeneity was due to the signifi- 
cantly lower correlations of the WISC 
standardization population. In another 
set, V IQ versus A, the white correla- 
tion (r = .73) contributed significantly 
GB = 9.13, df = 2, P « .05) to set 
heterogeneity with no difference be- 
tween the Negro (r = .57) and WISC 
standardization population (r = .55) 
correlations. All of the part-whole 
correlation sets between WISC P 
subtests and P IQ were heterogeneous 
due to the significantly higher white 
and Negro correlations. In the P IQ 
versus А set, heterogeneity (x? = 
12.36, df = 2, < .01) was due to the 
significantly lower intercorrelation of 
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INTERCORRELATIONS AMONG WISC SUBTEST ScaLep 


TABLE 2 


Waite (N = 141) AND NEGRO (N 


Ira J. SEMLER AND Ira Іѕсов 


= 134) CuinpnEN AND WISC 


Scores, V IQ, P IQ, anp FS IQ ron 


STANDARDIZATION POPULATION 
WISC Subtests Ss А | 8 | Уос! ps| рс | вр | ол | c [viol pro lrsi0 
Information White 44) 48 -59 | .31.832 | 35 |.93 | 1r |.77 | ав 1.04 
(D Negro 42 47) .42 | 28) 44 | 29 [97 .82 |.71 | .45 |.66 
WISC -51 .49) .55 | 39.94 | 33[ о7 -26 |.64 | .44 |.59 
Arithmetic White 30) .52*| 421.30 | .53*| 4le*| 409 тз**| 56*/.70* 
(A) Negro "32 -21 | .23/.23 | :28 оо | ‘on [sz | 21 |.45 
WISC :40/.46 | .40.29 | 27 |-99 | /32 |.55 | 146 1.57 
Similarities White 49 | .22).36**) ов | ор | 12 | z9* | .35 |.59 
(S) Negro 55.32.51 | 18 [11 | 220 |777 | 135 |.64 
WISC :45 | .34.27 | .29 |.29 .15 |.55 | .41 |.53 
Vocabulary White .38 ** *|.52 |.74 
imum Морг 38.41 +35 |.42**| 96 | 82 e 
uis :90.43 | .24 .15 | 98 |74 |. 5 
88.36 |.33 |.30 | 22 [66 | .47 |.6 
Digit Span White Р 57* 
10 | .42 |.36 | .97 |.63**| .39 |. 
1 
on” пара -27 | .23 [93 | 312 |60 |30 b 
-23 |.20 13 | 24 |44 | .29 |.4 
Pictu i 
tion Comple waite 132 37 | 18 ar | nest 
(PC) Wise -29 |.43 | .20 |.53 | .70 43 
28 |.28 | .12 |.42 | .34 
Bl i 
ock Design White 53 | 40.51 | .75*.71"* 
egro 63 |.53 
Wise 40 | .21}.30 |. 52 
53 | .26 |.42 | .53]. 
Object Assembly Whi 
(0A) "| Moa 26 44 | 78°05 
WISC 32 1.28 |.80 «E 
-30 |.88 | .59 |. 
Coding Whit 
(C) carp Re 1 A 
WISC “31 | 32.35 
Verbal IQ White 9 |.89 
(V IQ) Negro 5 ‘87 
WISC 60 |.90 
Performance IQ | White 89 
(P IQ) Negro 87 
WISC 89 
Full Seale IQ White 
(FS IQ) Negro 
WISC 
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Negroes Ge = 
Whites (r = 56). 


Wrsc 

of TAC Structures Multivariate Analysis 
^arianc Penton г à 

Analyses e and Exploratory Factor 


we iud be noted that the WISC 
eid designed for factor analysis; 
made ^ an exploratory study was 
the WISG rovide comparative data on 
Negro ы structure for the white and 
inaya Pes: Prior to the factor 
Variance 8 multivariate analysis of 
hypoth SON as computed to evaluate the 
Sent s that these data might repre- 
rom independent random samples 
Populati Single multivariate normal 
3 of on with reference to the two 
канн. means and the two vari- 
Mig S matrices. If this hy- 
respect 15 tenable at least with 
atrice to the variance-covariance 
single h (or dispersion matrices), а 
Wor, actor analysis based on the 
ix = within-groups correlation ma- 
ен d be indicated (Cooley & 
homoge 1962). The hypothesis of 
t ap eous dispersions was not 
< CAE = 1.99, df = 45/24537 436, 
the 001). Further, the hypothesis of 
equals 108 епену of mean vectors 
Was n Y of population centroids) 
9/9619 supported (Р = 14.99, df = 
Sviden Р < .001). Sufficient statistical 
Cate th, existed at this point to indi- 
Strug oa dissimilar WISC factor 
White Tes would be expected for the 

ee Negro samples. 
on a single factor analysis based 
tion © Average within-group correla- 
rin is contraindicated, two 
Werg - exploratory factor analyses 
Sie using the maximum likeli- 
Bn Varimax rotation (Kaiser, 
өр ееп Ocedures, The hypothesis of 
: ata. 166 Was refuted for each set 
( a the tests for this hypothesis 
E. апсат for the white data 
9.91, df = 36, p < .001) and 


.21) compared with 
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for the Negro data (x? = 291.74, 
df — 30, p « .001). It follows from 
these statistical tests that a separate 
factor analysis for each set of data 
could be justified. The maximum 
likelihood procedure applied to the 
data from the 141 white Ss indicated 
only three significant factors from the 
test of the third factor residuals 
(х? = 12.63, df = 12). The data for 
the 134 Negro Ss, however, showed five 
significant factors which appeared to 
be a function of the program procedure 
in which the early factor values were 
not stabilized. In a rerun with the 
program limited to three factors, the 
results indeed stabilized with the 
second factor residuals just short of 
significance at the .05 level. The third 
run, reported here, was then made in 
order to produce results more com- 
parable to those of the white sample. 
The third factor residuals were clearly 
not significant (x? = 12.22, df = 12). 
The factor matrices for whites and 
Negroes are shown in Table 3. The 
comparability of factoring is indicated 
in part by the nearly equal properties 
of the total variance; the sums of the 
communalities over nine tests were 
4.648 and 4.657 as shown on the bottom 
line of Table 3 and the proportion of 
total variance accounted for was 
51.64% and 51.74% for the white and 
Negro samples, respectively. Only the 
Varimax rotation, as shown in Table 
4, was made for the white and Negro 
samples, although an inspection of the 
graphical plots of each set of data 
indicated that an oblique structure 
would provide а better fit. However, 
in view of the inappropriateness of the 
nine WISC subtests for a factor analy- 
he forcing procedure used to 
factor the Negro data, further trans- 
formations could hardly be justified. 
Rationalizing the extracted factors 
by inferring sets of psychological 
abilities from inspection of the WISC 


sis and t 
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TABLE 3 
Maximum LIKELIHOOD FACTOR MATRICES FOR Warre (N = 141) AND 


Necro (N = 134) CHILDREN 
White Negro 
WISC Subtests Factors Factors 
I п ш ht I п ш Ld 
437 
i .639 .331 | —.167 .945 .641 .139 | —.085 * 6 
ee "730 | —.137 | —.178 | ‘390 | “465 .381 | —.797 p 
Similarities .536 -340 | —.101 | .413 .690 .340 .167 "519 
Vocabulary -748 313 | —.004 | 658 .643 .233 228 C 
Digit Span Total 535 | —.157 | —.052 | 313 | “45g -046 | —.002 | -213 
Picture Completion .483 .162 -120 | .274| .694| — .054 086 de 
Block Design -669 | —.333 .001 .558 -429 | —.162 | —.178 de 
Object Assembly -673 | — 244 687 | .984| .572| — 77g] — 145l 185 
Coding -440 | —.307 | —.155 -312 | .378 | —.156 .133 4.057 
Za? 3.408 |. .658 | 582 4.648 | 2.862 -995 .800 | 4. 


subtests exhibiting nonz 


ero loadings 
has little meaning in thi 


S Study since 


Suggesting similar clus. 
mprehension abilities, 
C OA subtest loaded 
ite sample and 


Н organize visually 
Perceived materials, the Negro factor 
TABLE 4 
Varimax ROTATION Facror Matrices FOR 
Ware (N = 141) Anp NEGRo 
(N = 134) CHILDREN 

White Negro 

кекс = 

WISC Subtests Factors Factors 
I| ut | ps 
Information -227| .314|.437 
Arithmetic 033| 978/096 
Similarities +048) .171].020 
Vocabulary К -105| .066|.519 
Digit Span Тош +199} 11541211 

icture Com- 

letion -394| .112].499 
Block Design -385| .233|.249 
VPE ваз 2978] .041|.053 
Coding -309|— .058/. 185 


Was quite complex due to the inn 
of indeterminant factor loadings. O n 

the WISC A subtest loaded iat z 
on Factor III for the Negro s 
eaving a “clean” factor represi 
that aspect of verbally retained o 
edge gained only from formal edu X 
tion. Factor II for the white sample W D 

2 complex factor involving loadings r 
the WISC A, Voc, PC, and C subtes 


Race by Age Proportional Analysis Ф 
Variance of PM Total Scores 


Table 5 presents the means a 
standard deviations of PM total scor 
(sum of scores on sets A, AB, and en 
by sex for white and Negro cular 
at the T-, 8- and 9-year age psa 
As with the WISC, a proportional po 
ysis of variance was carried out "lize 
randomly discarding Ss to equ ain 
cell numbers (ү 24). The 7138, 
effects of race (Ё = 444, df = ү = 
P < .05) and age (F = 20.64, th 
2/38, p < 001) were signifi age 
Analysis of the simple effects for in 
indicated a significant irme en 
PM total Scores favoring white chi а = 
at the 7-year-old level (1 = 1.99, d9- 
50, p < -05) but not at the 8- carole 
year-old levels. While the 7-ye& 
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"T TABLE 5 
ae Sranparp Deviarions or PM 
о CORES FOR Warre (№ = 78) 
Necro (N = 82) CHILDREN 
BY Sex АТ THREE 
AGE LEVELS 


Ss Age in years 
Total 
White pA 
Boys 
N 
12 15 12 39 
M | 15.83 95.20 |2142 
Ging | 8:52] 8-18 | 5.88 
N 
и 1215 m 12 39 
i 15.00 |22.33 | 22.08 
8.16 | 5.34 | 5.14 
Negro 
Boys 
N 
и |2 |20 12 44 
с 9.42 |21.85 | 21.75 
Ging | 9-53 | 5.19 | 5.86 
N 
м 112 и 12 38 
е 12.07 [20.57 | 19.50 


И 

differen uted to the significant race 

total ees reported above for PM 

action cores, the Race X Age inter- 
was not significant. 


M 
What Intercorrelations: Comparison of 
e and N 
egro Samples 


aes a 
on pe, intereorrelations among scores 
White а, sets A, AB, B, and total for 
3 al nd Negro children are reported 
Were © 6. All intercorrelation sets 
Patt who geneous. However, one 
ец Ole correlation set, РМ АВ 
(его total, was significantly 
With E*heous (z = 3.39, р < .00D 
ts eet correlation for whites 
Ln ) than for Negroes (r = .80). 


err, 
Солрдунейлевв of WISC and PM: 
les Son of White and Negro Sam- 


Tab 
betwee 7 presents the correlations 
the WISC subtests and IQs 
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with PM sets A, AB, B, and total for 
white and Negro Ss. In general, cor- 
relations were small with set hetero- 
geneity evidenced for PM AB versus 
WISC I, A, and V IQ where the cor- 
relations were signifieantly higher for 
white Ss than for Negroes. The cor- 
relations between WISC FS IQ and 
PM total scores for whites (r — .47) 
and Negroes (r — .34) were not sur- 
prising in view of the many reasons 
for a moderate or even low relation- 
ship. It should be noted, however, that 
previous studies (Barratt, 1956; 
Martin & Weichers, 1954; Stacey & 
Carleton, 1955) have reported gen- 
erally higher correlations between the 


WISC and the PM. 
DISCUSSION 


One of the guiding hypotheses in 
this study was that white-Negro 
differences in level of measured intelli- 
gence would be less on a test involving 
cognitive abilities than on a test loaded 
with psychocultural factors and this 
was partially borne out. Significantly 
lower WISC FS IQs were obtained for 
the Negro children compared with 


TABLE 6 
INTERCORRELATIONS AMONG PM Sets A, 
АВ, B anp TOTAL ScoRES FOR 
Warre (N = 78) AND NEGRO 
(N = 82) CHILDREN 


PM Sets Ss A | AB B | Total 
A White .64 | .54 | .80 
Negro .49 | .46 | .75 
AB White .65 | .91* 
Negro .57 | .80 
B White .86 
Negro .86 
Total | White 
Negro 


ж < 01 that the set of two r's tabled 
in the cell are estimates of the same popu- 


lation. 
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TABLE 7 
У VEEN SUBTESTS 
RELATIONS BETWEEN WISC 
Eo IQs anp PM SunTESTS AND TOTAL 
Scores ron Wurre (№ = 78) 
AND Necro (N = 82) 


CHILDREN 
PM 
WISC бз e 
^ | AB| B | Total 
Information White | .27 | .28**| оу 28 


Negro | 07/01 | 207 | <io 


ithmeti White | .20|.29* | з 
AriMatietio Negro | 19| | 739 a 
Similariti White | .29] 40 | 38| ао 
Similarities кА 3/2 | s 
Vocabulary White | .29 | .34 19 


Negro | .21| 23 | 28| 73 


White | 17] .19 B] В 
Negro| .07 .08 .10 14 


Digit Span Total 


Picture Comple- 
tion White | .23 |. 30 .26 E 
Negro | .20 +23 ч +23 
White | .37 E +33 . 
Negro | .22 | | 28 -30 -33 
White | 26 +30 -26 +32 
egro| .12 | .26 -18 +23 


White | 09 ч -16 15 
Negro |—:08 -00 O01 


White | .33 | 40° -33 .42 
Negro | 125 -19 27 31 


Block Design 
Object Assembly 
Coding 

Verbal IQ 


Performance IQ 


(prorated) White | 33 | ао .35 42 
Negro | <18 | ‘99 23 -30 

Full Scale IQ White | .37 | 55 -38 E 
Negro | 124 | ‘97 .29 ES 


of two 
Pulatio; 


v's tabled in the cell are 
white children at each of the age levels 
studied; however, no significant differ- 
ences were obtained on PM total 
scores for these same Negro and white 
children at the 8- and 9-year age levels. 
Significantly lower PM total scores 
were found for Negroes at the 7-year- 
old level suggesting that age and/or 
experiential factors related to age may 
be influential in cognitive develop- 
ment. The findings may be explained 
partially by the differential demands 
of the two instruments. WISC per- 
formance is influenced by verbal com- 
prehension and experience with verbal 
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manipulation, the ability to tolerate 
distraction including time pesar: 
and previous experience with eee ae 
designed to develop manipulative s a 
and perceptual organization. In i 
trast to the Negro children, these w i 
Ss had ample opportunity for the Y 
development of verbal skills and Ax 
acquisition of a fund of social шит” 
Шоп as well as early life NET 
with a variety of play materiali, аз 
of which contribute to the deve Led 
ment of those mental abilities ae 
by the WISC. On the other hand, si 
PM was designed to tap those bin 
tive abilities which are less ai е 
to individual differences jn prior 
experience and learning and the Ne- 
vironmental background of the a 
groes did not operate to reduce P аё 
formance on this test as was the ca: 
on the WISC, the 
In view of the finding that ani 
performance of the older Negro it 
white Ss did not differ on the P? S oi 
might be useful to point to qoe 
Tesearch findings (Semler & ir 
1963) showing that the PA lear n 
of these same 8- and 9-year-old pn^ r 
and white children did not Gr 
despite Significantly lower WIS! rials 
Qs for the Negroes. The PA mate the 
Were objects commonly found in rd- 
environment of most children “en 
less of socioeconomic condition. ain 
PA task required Ss to associate T ese 
of objeets or colored pictures of - А 
Objects by the anticipation me vere 
The object and Picture pairs Y ай 
arranged іп conceptually Bonet п 
dissimilar sets and required S to р? 
new associative responses. The the 
also involves new learning imn, 
testing situation by requiring terns 
observe the relationship of pat ke 2 
within a perceptual field and ane a- 
Correct association. This ИШЕ ЕГИ 
Чоп depends, of course, on the - i 
similarity of the abilities requir 
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pde situations but is offered as 
telaio: ce for future research on the 
ңе nship between learning and 
msn intelligence. 

lc ae, white-Negro inter- 
e : ion matrices for the WISC 
iin m ike PM provided the basic 
s. the structure-of-intelligence 
ibit ee The white-Negro PM 
a intercorrelation matrices were 
faga Шар, suggesting that the 
cogniti ationships among the stages of 
окы уе development sampled by this 
E quite similar for the two races; 
subjected this assumption was not 
о th ed to further analyses due 
Furthe small number of subtests. 
Ve м research comparing the de- 
groups ent of cognitive abilities for 
"io with varying psychocultural 

he ounds seems promising. 
ation white-Negro WISC intercorre- 
With уана appeared to be similar 
Corre] "^: exception of the lower inter- 
or the 808 involving the А subtest 
u бы Negroes. Performance on this 
M xa directly related to verbally 
throu h knowledge learned only 
ады formal education. The multi- 
© analysis of variance indicated 
Struc statistically dissimilar WISC 
for fhe e existed for the whites and 
tory € Negroes and separate explora- 
a engl analyses were carried out. 
Subtest be noted that the WISC A 
tor for was practically a unitary fac- 
Whites the Negroes but not for the 
ter or Interestingly enough, the clus- 
Was ы verbal comprehension abilities 
facto, oilar for both races. Unique 
Used ;analytie procedures have been 
ang 0" the WISC with both white 
1964у 870 children (Osborne, 1963, 
Droh, em however, special statistical 
Sons RE arise when group compari- 
designed made and the WISC is not 
"ако ар for this purpose. It seems 
% Dar. € to suggest that future 
ative factor-analytic research 
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eould well explore separately those 
domains of intellectual abilities the- 
orized to be influenced by psycho- 
eultural variables and formal training. 
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DISCRIMINABILITY OF WORDS, AND LETTER CUES 
USED IN LEARNING TO READ! 


S. J. SAMUELS? anD 


W. E. JEFFREY 


University of California, Los Angeles 


The 36 kindergarten Ss were distributed among 3 groups. Using the 
paired-associate anticipation method, each group was taught 1 of 3 


lists of words that differed in diser: 
determined by the number of differ 


iminability. Discriminability was 
ent letters (either 4, 6, or 8) used 


to construct 4 2-letter words. It was hypothesized that the more dis- 
criminable the list, the faster the learning rate but the greater the 


probability that Ss would learn on t 


he basis of single letters. The hy- 


pothesis concerning learning rate was confirmed with p < .01. After 
the 4- and 8-letter groups were brought to the same criterion, the 8- 
letter group was found to identify words on the basis of single letter 


cues significantly more often (p < 


.02) than the 4-letter group, thus 


confirming the 2nd hypothesis as well. 


Similarity among stimuli has not 
en shown to retard paired-associate 
oe (Underwood 1953a, 1953b) as 
Pr eo as one might expect from notions 
© the effect, of stimulus generalization 
1 learning (Gibson, 1940). This may 
s due to the fact that paired-associate 
aiming is typically done with adult 
Ubjects (Ss), and even though low- 
meaningfulness nonsense syllables may 
sti used (Underwood, 1953a), the 
wen units (letters) are already 
i discriminated. If this were true, 
8h-stimulus similarity in paired- 
Sociate learning with children should 
chil uce greater difficulty in that 
the um would be less familiar with 
ity etters, and thus increased similar- 
tho Might produce results similar to 
none Obtained by Gibson (1941) using 
Sense figures with adult Ss. 
k Ost early reading texts select words 
the basis of their frequency of usage. 
chil Increases the likelihood that the 
Word 1s familiar with the sound of the 
fu]; d that the sound is meaning- 
Out 7°02 as Silberman (1964) points 
be ^, the high-frequency words tend to 
"regular and to differ markedly 


1 H 
Sta Tis research was supported by United 
to th 3 Public Health Service Grant 07937 
249 Second author. 
OW at the University of Minnesota. 
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from each other in configuration and 
thus to be quite dissimilar. Further- 
more, it is not unlikely that high 
similarity is purposely avoided as a 
potential source of confusion for the 
child. Although such procedures for 
word selection in teaching reading may 
facilitate initial learning, effective 
reading training should also provide 
the ability to transfer from the initial 
training such that the reader may emit 
correct sounds to new and unfamiliar 
combinations of letters. The extent to 
which this happens depends on what 
becomes the stimulus in the initial 
paired-associate learning situation. An- 
derson and Dearborn (1952, p. 215) 
suggest that “what children mainly 
seem to have an eye for is some detail 
of the word, a single letter or two 
which comes to stand for the whole 
word...." Marchbanks and Levin 
(1965) provide support for this proposi- 
tion in that they find that letters were 
used more often than word shape as a 
cue for recognition. If a single letter 
may serve as a cue for the whole word 
response, then rapid initial learning 
may indeed be a false economy. When 
a child provides a response to a new 
configuration of letters on the basis of a 
single letter cue, he may not even 
recognize that he has made an error 
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until it becomes obvious from reading 
the rest of the sentence. 

In a study that compared the rela- 
tive transfer effects of word and letter 
training (Jeffrey & Samuels, in press) 
it was necessary to use common letters 
in each of two words in order to assure 
that the children utilized each specific 
letter. The results suggested that four 
words constructed of four letters were 
harder to learn than four words con- 
structed of six letters, that is, learning 
was retarded by similarity. The present 
study was designed to test the relation- 
ship between the rate of learning and 
the number of letters, and, in addition, 
to evaluate the degree to which transfer 


e more likely to 
new words on the basis of a 


n they are forced 


; transfer would of 
course be very poor, 


a Santa Monj 
school. The median age of both 
5 years 5 months, 


a nursery 
Eroups was 


Material for Learning Reading Re- 
sponses 


An artificial alphabet was created that 
had as little resemblance to English letters 
as possible. The responses to be associated 
with the letters were common English words 
assumed to be part of the Ss' repertoire 
(Figure 1). Three lists of two-letter words 
were constructed. List 4-L had four two- 
letter words constructed from only four 
different letters, List 6-L had four two-letter 
words constructed from six different letters. 
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Group R-L 


Words Pronounced 
el DA 
ts їй 2 
il tA ul 
хй Xs ш 


Fic. 1. Symbols used in each of three dif- 
ferent lists and their pronunciations. 


List 8.L had four two-letter words ора. 
structed from eight different letters. т 
Words were printed on 5 x 8-inch in i. 
cards. These were presented serially td 
hand with an approximate 4-second pep 
interval. Each word list was presented 
three orders, 


Material for Test of Cues Used ™ 
Learning Reading Responses 


On each test card one of the artificial 
letters which S had seen in his аы! 
training was now replaced with a letter en 
he had never seen before. In other n 
the words were presented again, but this Sie 
with a new letter substituted for one of 4- 
letters in each word. This means Group 8-1 
Was presented with 4 words, and Group tter 
was presented with 8 words. The new a 
appeared half of the time in the first posi n 50 
and half of the time in the second posae 
that each original appeared once БЕРИ 
bination with the new letter and the ФЙ Sic 
etter always appeared in its original Pi 
tion. Thus, if § had learned to respon the 
Eoy of the words in the original list онсе 
basis of only one letter, he should not ee 
a change in at least half of these new Pha 
and he should provide the response he 


7 vi ut 
Previously learned to that letter witho 
hesitation. 


0, 
Procedure for Evaluating Effect / 
umber of Words on A cquisition 


0 
Twelve Ss were randomly assignee 8, 
each of the three different training nta 
Each S was brought into the experts iy 
Toom and told that he was going to Pid 10 
Бате. After he was sented, he was Men in 
select a small plastic toy, to put the ‘back 
his desk, and that he could take the toy in 
io his room when he was finished prt 
the game. He was then shown the s as- 
words in the list, told the word to to 
Sociated with the letters, and as ation: 
Tepeat the word after each presento 18 
© was then told that for the rest 


DIscRIMINABILITY OF CUES IN LEARNING TO READ 


tria 
m е must try to say the correct word 
hu ы 1e experimenter said the word. 
" жа the learning group were given 
Cases Jn of the list and each correct 
was recorded. 


P. А 
mai Jor the Transfer Test of Cues 
ed in Learning Reading Responses 


qa structions were the same as for the 
to euch ie DS. Twelve Ss were assigned 
Was train € groups for training. One group 
L. Bane on Set 4-L, and the other on Set 
of two pis were trained to n criterion 
first 18 ee ide correct trials. For the 
a conna ЦЕ pairs were presented in 
Sübsequer apr order. On the 19th and all 
word Dai m trials, the order in which the 
This s irs were presented was randomized. 
ү procedure facilitated learning mark- 
Unos БЕ шашу, with the four-letter list. 
i E criterion, each S was shown 
you like por serially, and asked, “Would 
Tesponge s me what word this is?" His 
resented. as recorded and the next card 


" RESULTS 
tx the acquisition groups the mean 
20 {+ er of correct reading responses In 
Rigo db was 24.00 for Group 4-L, 
Tou for Group 6-L, and 54.00 for 
Cates 8-1. Analysis of variance indi- 
19.05 a significant difference (F = 
Comp; of = 33, р < .01). Specific 
Gro ürisons between the means of 
"Oups 4-L and 6-L, 4-1, and 8-L, and 
Drogen 8-L, using the Newman-Keuls 
indicata (Winer, 1962, pp. 80-85), 
Signi ed all of these differences to be 

cant (р < 01). 

test ^ two groups used for the transfer 
fhe p brought to criterion before 
qu ie | The technique of first present- 
0, d tia 5 ша constant serial order 
мы Tria is proved to be very efficient. 
29 Group 4-L had learned a 
ed a m, Balis and Group 8-L had 
«trial E of 3.2 pairs. Following 
°З ду ized, G quence, the pairs re 
trials 4 3roup 4-L took a mean o 
ean of о criterion and Group 8-L 
al | 21.16 trials including the 


trials of geria order 
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training. The performance on the 
transfer test was as predicted. Group 
S-L made a significantly larger number 
of false identifications than did Group 
4-L (too = 2.60, p < .02). Ten of 12 Ss 
in Group 8-L made one or more false 
identifications on the transfer test as 
compared with only four of 12 Ss in 


Group 4-L. 


Discussion 


The hypothesis that stimulus simi- 
larity would prove to be a relatively 
important factor in paired-associate 
learning, if the stimuli are unfamiliar 
and children are used as Ss, received 
strong support although it was also 
shown that with specialized training 
(constant serial-order), the effect of 
stimulus similarity on learning was 
reduced considerably. The results 
provide clear evidence that the number 
of Ss who make identifications on the 
basis of a single letter increases with 
the number of letters on which they 
were trained, that is, increases with 
dissimilarity of stimuli. Thus, as 
hypothesized, training that forces at- 
tention to each letter is less likely to 
lead to subsequent reading errors than 
training whieh permits the child to 
identify words on the basis of a single 
feature, In an earlier study, Jeffrey 
and Samuels (1965) found that when 
children are given letter training first, 
Ss had much less difficulty identifying 
the words in List 4-L than Ss which 
were given word training with the 
words in List 6-L prior to learning List 
4.L. In combination, then, these 
studies suggest that, in teaching chil- 
dren to read, it is important that the 
child learns to attend to all of the 
letters. This can be accomplished by 
either specific letter training or it may 
be accomplished by giving them experi- 

hly similar words which 


ce with highly 
fun identification on the basis of 
a single stimulus feature. 


more than 
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Before accepting this generalization 
uncritically, it should be noted that 
the training situation used a highly 
restricted and unusual alphabet, and 
each child was trained individually by 
the experimenter. Furthermore, a small 
tangible incentive was provided to 
assure that S would be cooperative, 
and would remain in the experimental 
situation. 
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SIX STRUCTURE-OF-INTELLECT HYPOTHESES 
IN SIX-YEAR-OLD CHILDREN: 


di А К. E. ORPET 
valifornia State College at Long Beach 


AND 


C. E. MEYERS 
University of Southern California 


й 20-test battery hypothesizing 6 Guilford structure-of-intellect cells 
vas administered to 100 6-yr.-old kindergarten children. Principal com- 
ponent factors were given several varimax rotations. Hypotheses 


well sustained were memory for 


symbolic units (MSU), convergent 


semantic production (NM), divergent production of semantic units 
(DMU), and evaluation of figural units (EFU). A tentative factor was 
БОШО for symbolic units (MFU). Tests for an evaluation of sym- 
olic units (ESU) factor loaded on EFU, suggesting that reaction to 
visual language symbols is still perceptual at age 6. Separation of 
MSU from MFU was believed due to visual vs. auditory channels. 


Ns шау of ability factors in 

oa д has passed the stage 
exist rating that separate factors 
ematie” h effort now devoted to sys- 
abilitie identification of measurable 
t T Me The present study is an ex- 
With 6 of previous explorations made 
(Ss) -year-old preliterate subjects 
(Loeffler, 1963; McCartin & 


e 
Yers, 1966; Meyers, Dingman, Or- 


Det qi S 
Orpen Kei, & Watts, 1964; Meyers, 
Tho.” Attwell, & Dingman, 1962). 


ato о Studies had demonstrated sepa- 
таър mary abilities in motor, se- 
бортів, and figural domains, and in 
V Matin’ convergent, divergent, and 
Ve processes. The present 
ideas designed to confirm some 
the Slee factors and to extend 
апа gp, Oration into memory processes 
Ymbolie content. 


h 
S structure-of-intellect (SI) lan- 


1 
MT 
8 H 
шине of tea in part by the National In- 
opor. „Мема! Health Grant No. MH 
the Сайа ifferential Abilities of Six-Year- 
trig, Los Pe Appreciation is expressed to 
kin Board 0008 Elementary School Dis- 
Who “Tarte of Trustees, administrators, 
m Cooper, teachers, children, and parents 
ig dtro, Tated with the study; to Paul 
takte; ш, 107 Invaluable chometric as- 
а Е psyc a 
7s edite, О Michi Yomogida for pam- 
by um E asistance, Data were prot- 
Я Univer Western Data Processing 
? sity of California at Los 


Фуд] 
Stuq 
of the 


guage in this report reflects the model 
of human ability given by Guilford 
(1959) and by Guilford and Hoepfner 
(1963). In the SI, "cognitive" refers 
specifically to knowing or recognizing, 
not to all processes. It is distinguished 
from "convergent production," in 
which problem-solving is required, as 
in completing a verbal analogy. “бе- 
mantic” and “symbolic” differ in that 
the former deals in meanings of lan- 
guage systems while the latter deals in 
the symbols apart from their meaning, 
as in anagrams, algebra, or grammar. 


НүротнЕѕЕЅ AND Test BATTERY 


The hypothesized factors were de- 
veloped primarily from Guilford’s 
(1959) theoretical model of the struc- 
ture of intellect and the child-develop- 
ment literature regarding the develop- 
ment and individuation of abilities. 
АП of the hypothesized abilities have 
been well established at the adult level, 
but have not all been established by 
replicated factor studies as existing in 
childhood. The following criteria were 
used in the selection or development of 
instruments to measure the hypothe- 
sized abilities: (a) suitability for indi- 
vidual administration with 6-year-old 
children, (b) face validity of the test 
as a univocal measure of the hypothe- 
sized ability, (c) previous utilization in 


al 
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study, and (d) efficiency in 
25 E Of [^ 20 tests used in ше 
battery, some were used as ренү 
standardized, while others ks 
adapted for use with 6-year-old chil- 
dren, or were developed specifically for 
this study. Six ability domains were 
ized: 
a ag ee memory for figural units 
(MFU). Test 1, Cube Test (Hildreth 
& Pintner, 1937). S taps cubes in same 
sequence as examiner. Test 2, Picture 
Memory (S names objects previously 
seen, now screened). Test 3, Memory 
for Patterns (S points to previously 
seen pattern when mixed with others). 
2. Auditory memory for symbolic 
units (MSU). Test 4, WISC Digit 
Span, forward (Wechsler, 1949). Test 
5, Auditory Letter Span. Test 6, Mem- 
ory for Sentences, taken from Binet 
(Terman & Merrill, 1937) with added 
items. Test 7, WISC Digit Span, back- 
ward (Wechsler, 1949). 
. Convergent semantic production 
(NM). Test 8, WISC Comprehension 
(Wechsler, 1949). Test, 9, WISC Vo- 


cabulary (Wechsler, 1949). Test 10, 
Action-Agent, Convergent (Stutsman, 
1931) with more 


difficult items added 
by McCartin and Meyers (1966). 

4. Divergent production of semantic 
units—ideational fluency (DMU). S 
must produce a variety of acceptable 
responses in each task. Test 11, Ac- 
tion-Agent, Divergent, modified from 
MeMartin and Meyers (1966). Test 
12, Monroe Language Classification 
(Monroe, 1935). Test 13, Utility Test, 
adapted from Guilford and Christen- 
sen (1956). Test 14, Thing Categories, 
adapted from French, Eestrom, and 
Leighton (1963). 

5. Evaluation of figural units 
(EFU). Speedy accuracy is required 
in identifying or matching figural ma- 
terials. Test 15, SRA-PMA, Identical 
Pictures (Thurstone & Thurstone, 
1953). Test 16, SRA-PMA, Identical 
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igur Thurstone & Thurstone, 
io nel 17, ETS Identical Pictures 
t a], 1963). Mm 
EY of symbolic. ш 
(ESU). Like 5 but with symbolic ae 
terials. Test 18, Word eg ^ 
(Harrison & Stroud, 1956). Tes e 
Number Comparison. Test 20, Finding 
0%. 


SUBJECTS AND PROCEDURES И 
Тһе S's were 100 anglo-white ap 
50 boys and 50 girls, in five public BU 
mentary schools from a ap oc rtr 
dential suburb of Long Beach, m ‘with a 
CA's ranged from 71 to 75 mont ja based 
mean of 727 months. The mean 55. The 
on three WISC subtests used was Me dul 
battery required over an hour of es or 
testing, given in two, sometimes 
more sessions. 


RESULTS -— 
Raw score data, none require ter 
transformation, were employed. com- 
correlations yielded 1 gap gro: 
ponent factors by computer pr ns. In 
These were given several rotatio m 
formation presented here is Бай Те вет, 
varimax orthogonal rotation ( ber of 
1959) of seven factors (the motation®! 
hypotheses plus one). Other Т0! "Ingical 
Options, each based on mino! 
differences, were employed um 
mine whether the resulting § mean 
would differ in psychologically 1 to be 
ingful ways. With one excep T 
diseussed the options did m is 
the interpretation. The analys. the 
sented as a whole in Table L pont in 
first six factors (A-F) peu the 
an order most likely to тергев " 
hypotheses. jve 
MU. Factor A is aes " 
identified as support for the 2 of the 
sized factor, as it saturates ine 
variables intended, together W lor N 
forward, which was intende 


hang? 
" 


NT 
lig! 
180. 
Test 1 Knox cube tap- 
ping 


6 
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Test 2 Picture memory .09 
| 32onF 
Test 3 Memory for pat- i ; 
terns -35 
(.39 on F) 


Test 4 Digits forward, 
designed for 


MFU 43 
EL Tests 4, 5, and 6 load well 
actor B which may be identified 


is sought-after MSU. Digits for- 
rd divided its loading. 


Test 4 Digits forward 


Test 5 Auditory letters 
Test 6 Memory for sen- 


tences 


Test 7 Digits backward 


En 
(.43 on 
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A, 


MFU) 


.66 


.64 
AT 


(.33 on E) 


3-NM. Factor C appears to support 
this hypothesis. One variable had a 
second loading. 


TABLE 1 
Variuax ROTATED FACTOR LOADINGS 
A B c D E F e |e 
1. Mp х E. 
Sud Visual memory for figural 
1. Cube test o»| 10| 08| 06} 12 06| —01| 44 
leture memory 09 03 17 12 | 08 32 01 | 26 
Memory for patterns a»| 14| 11| 07 | 25 39 | 09 | 42 
2. Ms 
Msy, Auditory memory for sym- 
€ units 
1. WISC Digi T 05105 | 18| oja 
SC Digits Forward 495 47 03 5 T 
б. Auditory Tarten Прав dol | —08| —03| o3 | —0£| 00) 47 
$: Memory for Sentences д0] osl 27] 12 16 | 08) —08) 59 
: WISC Digits Backward 291 17| 20|] 30/33 | 04] 18) 47 
è. N 
M, Convergent semantic produc- 
on 
-Wisco i „| o| oo) 19} or | 02) 05 55 
2 Comprehensior 12 б E 2 
& Wisc Sauer Н o) 07|] 7r| 27/19 24 EC ү 
4 Action-Agent Convergent 08 | —04 48 16 | 06 п 6 
Ein) р 
мо, Divergent production of se- 
PW A ate 5 25| —17 | 46 
12. à p Hon-Agent Divergent . 01 E 3 Lal m M | Zos | 60 
з ре Language Classification р E 2 бл 06 06 54 
* Thing Categories 001 os| 18| 57°08 —13| 16,43 
yg 
СЕР | 
15 p^ Evaluation of figural units z | —25| —10 68 
16, ЪМА Identical Pictures 19 09 14 Е баз Lo 85 | 70 
1T E ТА Identical Figures 14 09 29 20 81° | —06 | —03 78 
"T TS Identical Pictures 191 uj 02) 29 : 
© ESU 
; Evaluati d А 
1; aluat у {5 "m m 
8. Word ree ec mbolie units "E Б -% 2 a " - 
umbe i =01 | —03 | —04 2 38 
Rindge Oo | 46 | 08 05 00 | 58^ 22 09 | 47 
lors ^ * —Devcimals have been omitted. h? (communality) Based on the 12 extracted fac- 


; 
* dr буер, only 7 factors were rotated. 
Cates loaded factors. 
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Test8 | WISC compre- 
hension .69 
Test 9  WISC vocabu- _ 
lary А а 
Test 10 Action agent, К 
n convergent (.39 on G 
a singlet) 


4-DMU. This appears to be repre- 
sented in Factor D. All four tests 
loaded as predicted to support the hy- 
pothesis. 


Test 11 Action agent, di- 
vergent .59 


Test 12 Monroe lan- 

Euage classifica- 

tion .59 
Test 13 Uses .66 
Test 14 Thing Categor- 

les .57 
5-EFU. Easily identified as evi- 

ЕД 


Test 15 SRA PMA iden- 


tical pictures -70 
Test 16 SRA PMA iden- 

tical figures .64 
Test 17 ETS identical 

Pictures 81 
Test 18 Word compari- 

ison, designed 

for ESU 78 
Test 19 Number com. 

parison, designed 

for ESU .52 
Test 20 Finding 0's, de- 

Signed for ESU .58 


6-ESU. Table 1 shows that no 
factor loads on Tests 18, 19, and 20, 
designed for ESU, except of course for 
the EFU factor. When rotation of al] 
12 factors was performed, the Picture 
changed somewhat, While these three 
tests continued to load on EFU, a good 
Factor “F” appeared, with substantial 
saturations of these variables, suffi- 


cient to suggest the ESU factor might 
be represented. 


strong 


Discussion 


Well-defined factors appear to have 
emerged and to represent evidence for 
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four of the six hypothesized abilities: 
MSU, auditory memory for symbolic 
units; NM, convergent semantic pro- 
duction; DMU, divergent production 
of semantic units; and EFU, ovalus 
tion of figural units (perceptua 
speed). MFU, visual memory for figu 
ral units, was uncertain and aet 
ESU, evaluation of symbolic Lr dien 
hypothesized with perceptual tasks d 
letters, numbers, and words, but xou 
not be separated from EFU except 
the caprice of a rotational option. m 

The demonstration of a eee 
memory factor orthogonal to o ihe 
abilities replicates similar work ат b- 
developmental level in question ien 
bon, 1964; Kelley, 198; Loeffler, 1 gi 
and McCarthy & Kirk, 1964 in a 
but not all of their analyses). € a 
or more short-term memory fac iP 
may exist has some support, such E 
found in retarded of MA 4 by Сап ү 
Meyers, and Dingman (1965) inw АЛ 
the visual faetor was unclear, er e 
Was in the present study, thoug cer- 
separation from the auditory va no 
tain enough. Digits backward di atu- 
join the MSU factor, but its best yard 
ration, with DMU, is no surprise ards 
sidering the nature of the backw 
task. ic 

The NM or convergent sema 
production factor appears to be а -old 
member of the repertory of б-уе® yr 
abilities. This is an aspect of Se er 
ditional general-verbal ability g the 
brated in age scales, and replicate and 
factorial work by Loeffler по), js- 
McCartin and Meyers (1966). Me- 
Sue of factor name might arise. able 
Cartin and Meyers had not Бе пс" 
to make a very sharp factorial ergen 
tion between cognition and conver’ is 
Operations (discussed above). anno 
possible that at this age the two С The 
be separated in practical teeny jire 
Present tasks were judged to of ех" 
more of the child’s formulation 


ee 
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planations than of comprehending ver- 
ү Input, so the “N” or convergent la- 
= "e preferred. The question of how 
е difficulty of “С” or cognition en- 
s in the tasks is not presently an- 
А erable, As to the absence of a third, 
di Droducts, term in the lexicon of the 
Tucture of intellect, (as U for units, 
tio bi classes, ete.), this was inten- 
€ ally avoided since the children’s 
n Ponses to Tests 8, 9, and 10 had to 
nge from single words to sentences. 
Me MU or divergent semantic units 
a je to hold a secure position 
ү е adaptation repertory of this age. 
ee ideational fluency” factor was 
amed with some of the tests used 
У MeCartin and Meyers (1966) and 
Же new ones, and is apparently the 
in factor as that given different 
у ko e (Bereiter, 1961; Loef- 
1964). ; and McCarthy & Kirk, 


8 


А Evaluation of figural units or EFU 
Dor Bood factor, the results here sup- 
Ing the investigations of Kebbon 
(19 "s Loeffler (1963), Meyers et al 
0.169), Meyers et al (1962), Stott and 
(1963), and Thurstone and Thur- 
Speeg» 1999). This is the “perceptual 
Nam factor, under this and other 
Varian, The factor also captured the 
implice of the ESU tests, with the 
me Cation that children this age still 
aga act to visual symbols of lan- 
Study as perceptual tasks. While this 
оду Produced a separation of MSU 
that FW in the memory domain, 
Шер Шегепсе was due in part to the 
di eid in sensory input channel, a 
Contras зу not present in the ESU-EFU 
toy, mary, the present study has 
Of py; ned and extended the knowledge 
of б Mary ability factors in children 
n th Onsidering the present evidence 
ое light, of other reports, the hy- 
tmy оѓ MSU, NM, DMU, and 
арреаг to be confirmed now be- 
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yond reasonable question, with the 
possibility that MFU may be yet de- 
veloped. 
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METHODOLOGICAL NOTE ON FEELING-OF- 
KNOWING EXPERIMENTS: 
J. T. HART 
University of California, Irvine 


Further investigations of the ability of Ss to accurately pre- 
dict memory contents when they are unable to retrieve those con- 


tents, a process termed the memory 
are reported. 2 possible sources of error in the 
were examined—either source could have falsely 
accuracy scores. The replication findings show T 
instruct Ss to guess even when they believe that they 


-monitoring or MEMO process, 
earlier experiments 
inflated the MEMO 
that it is important to 
do not know the 


correct answer. The instruction to guess forces overcautious Ss to 
provide answers that they would otherwise withhold. Answers guessed 
correctly can be eliminated before MEMO accuracy is evaluated, thus 
overestimates of MEMO accuracy are avoided. 


it possible for people to know what 
memory even when they are un- 
mr retrieve the memories? Evi- 
ment provided in two previous experl- 
ie (Нан 1965) suggests that they 
ing © пе accuracy of feeling-of-know- 
ing Xperiences was assessed by apply- 
RIR a recall-judgment-recognition 
(Ss) 2 | paradigm, in which subjects 
est irst took a general information 
try Ad recall and were instructed to 
"a predict recognition failures and 
айу, Sses for items they were unable to 
мач by referring to their feelings of 
Tec about the sought-after items. 
celi aking the recall test and making 
ng-of-knowing judgments, (FK) 
m FK (no)? Ss were given à 
5 us SRM test. Usually Ss were able 
i Sg ‘a accurately which unrecalled 
the they would recognize and which 
М Would not recognize. 
he intervening process yielding FK 
Mem, E judgments is termed the 
Е during (MEMO) process. 
Ава} = author gratefully acknowledges the 
опат H reccived from Albert Hastorf, 
ti ? See H: orowitz and Karl Pribram. 
on art (1965) for a complete explana- 
the terms FK Hit and Miss and 
P ed Miss; they correspond to the 
alse р Ysical terms True Positive and 
ative 16, and True Negative and False 
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An accurate MEMO process could be 
useful to a fallible storage and retrieval 
system as an indicator of when to con- 
tinue and when to stop retrieval ef- 


forts. 

There is some possibility that the ex- 
perimental procedure used in the previ- 
ous experiments overestimated MEMO 
accuracy. The follow-up experiments 
reported in this article attempt to 
eliminate sources of accuracy inflation. 


METHOD 


Experiment I 
In the earlier experiments Ss made feel- 
ing-of-knowing (FOK) judgments for those 
items they could not answer. The instruc- 
tions specifically told them to make FOK 
judgments “Гог all those items you must 
leave blank.” This instruetion assumes that 
items will be left blank only when Ss believe 
they have not remembered the correct an- 
swers. Otherwise it does not make sense to 
talk about the MEMO process as an indi- 
cator of what is in storage at times when 
retrieval is not successful, and to use the 
prediction of recognition suecesses and fail- 
ures to assess MEMO accuracy. Interviews 
with some of the Ss after the completion of 
the experiments revealed that this key as- 
sumption about the meaning of blanks in the 
test of recall was not entirely warranted. 
Some Ss reported leaving the answer 
spaces blank, and making FK judgments, at 
they strongly believed they had 


times when t 
already retrieved the correct answer. This 
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i e because, occasionally, S re- 
rimum than one answer to a question 
(one of which he felt would be correct) but 
was unable to decide among the several al- 
ternatives. This could have produced a false 
inflation of the FK hits and the comparison 
of inflated FK hits to ihe FK misses would 
lead to a false conclusion about MEMO ac- 
curacy. Inflation of an FK hit score would 
occur any time the alternatives on the recog- 
nition test did not exactly overlap the alter- 
natives S had provided for himself. during 
the test of recall. In effect, the experimenter 
would help S to choose between the alter- 
natives. : 

To evaluate this possible Source of error 
Experiment I was conducted using а modi- 
fied set of instructions. The modified instruc- 
tions told Ss to write several answers if they 


one. By using this 
receiving more 
liminated prior 


RESULTS 


The results show that the instruc- 
tion about multiple answers was effec- 
tive. Twice as many errors were re- 
corded in this experiment as in the 
earlier study, and there were fewer FK 
items and fewer blanks. This is the 
pattern of results to be expected if Ss 
followed instructions, since those mul- 
tiple-answer items formerly left blank 
and marked as ЕК% would be differ- 
ently categorized. 

But the altered instructions did not 
influence MEMO accuracy Scores. The 
mean of the FK hits is -70, FK misses 
39. A test of the FK and FK propor- 
tions of correct responses is highly sig- 
nificant (¢ = 5.25, df = 16, p < .01, 
one-tailed). : 

F'or a complete presentation of the 
rationale behind this comparison, see 
Hart (1965). The basic point of the 
FK-FK test is that, if FOK experi- 

ences are accurate indicators of mem- 


J. T. Harr 


ory storage, Ss should recognize more 
Tutus hey feel they know (FK Чеши) 
than items they feel they don't por 
(FK items). The папаи ШЇ ш 
portions correct for FK and FK i E 
isa relative accuracy score and, as aes 7 
is independent of the ease or к 
of the recognition test (unlike ше 
Solute number of FK and FK pre 
tion hits). | 

Wie ы answers are pm 
questions, the answers are as S ШҮ, 
Wrong as they are right. Conseque ea 
withheld multiple answers do not p 


ror in FT 
vide a significant source of error in 
Scores. 


METHOD 
Experiment II 


om- 
Not until Experiment I had pem o 
pleted was yet another possible soh iowa 
error recognized. Postexperiment "Trevenle 
with selected Ss from Experiment ificantlY 
that the Ss appeared to differ sign uess 00 
in their willingness to write down s recall. 
the answer sheets during the test nd woul 
Some Ss were extremely cautious 27 It com- 
enter an answer only when by ais were 
pletely sure of its correctness. Ot. ad better 
willing to write anything that seem 
than a sheer guess. many $8 
Overcautious answering by number 
could lead to a false increase in ше ques 
of FK hits, since Ss would leave OK ratings 
tions unanswered and give high FO t not en 
at times when they were nearly; oe 
tirely, certain of the correct era в 
The simple way to eliminate t xm an 
of error is to require Ss to constru r every 
Swer, even if they must е, only 9? 
question. A FOK rating is ma they have 
those questions that Ss believe rie бї the 
not answered correctly. Since үе? гесаї 
items answered correctly on the ee MEMO 
are included in the evaluation den shoul 
accuracy, the instruction to gu of corre? 
Prevent overcautious withholding a a. Р 
answers and inflation of FK hit dd to the 
Twenty-five questions were These ques 
recall and recognition tests. 
sec elis 


ource 


" were 
A ons 
? Some of these additional questio? 


i {оппа ris 
d $ a 200-item ques thor 38 
vaa by Murdock (1963). ThE йу вир 
indebted to В. В. Murdock for 
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tions were added in case the instruction to 
Suess resulted in a sizeable decrease in the 
absolute number of FK judgments made by 
the Ss. With 100 questions it was hoped that 
enough FK items would remain to yield some 
evaluation of MEMO accuracy. Fourteen 
Students from an introductory general 
Psychology course served as Ss. 


RESULTS 


The FK scores obtained in Experi- 
ment II were the same as those in Ex- 
periment I. With the addition of 25 
questions the number of FK responses 
Should have increased but didn't. 
Apparently, instructions to guess 
eliminated some of the items that had 
formerly received high FK ratings be- 
Cause the Ss were nearly sure, but not 
Certain, of the correct answers. 

F'urther evidence that the FK scores 

ave been somewhat reduced is that 
the mean of the FK hits dropped from 
3 0 in Experiment 1 to .52 in this Ex- 
Periment, (; = 3.6, p < .01); FK 
Means were not changed. 
th lowever, even with the reduction in 

е inflated FK hit score, the compari- 
Son of the proportion of responses cor- 


plying copies of his recall and recognition 
ests, 


rect for FK and FK items still yields a 
significant result in the predicted di- 
rection @ = 2.46, df = 13, p < .025, 
one-tailed). The MEMO process is ac- 
curate, but less accurate than had ap- 
peared in the earlier experiments. 

The results of the two experiments 
may be succinctly summarized. The 
presence of multiple alternatives to a 
few questions does not produce an in- 
flated estimate of the number of true 
FK positives. No special provision 
need be made for handling multiple 
alternatives. Overcautious withhold- 
ing of answers, however, does consti- 
tute a significant source of FK infla- 
tion. Consequently, all experiments 
employing the RJR paradigm should 
instruct Ss to provide answers to every 
recall question, even if they must guess. 
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NDI- 

F SELECTED METHODOLOGICAL co 

TIONS ON DEPENDENT MEASURES TAKEN AFTER 
CLASSROOM EXPERIMENTATION: 


WILLIAM L. GOODWIN? 


University of Wisconsin 


The necessity of controlling the influence of selected extraneous varia- 


bles upon data collected 
gated. The experiment 
4 strata according to 
stratum, classes were randomly 
treatments generated by a 24 
variables: experimental atmos 
test and no notice; teacher а; 
tion; and teacher scoring and 
on subtests of a standardized 
jected to analyses of varianci 
means were found for those 


after classroom experimentation was investi- 
al units were 64 Gth-grade classes, grouped into 
previous arithmetic achievement. Within each 
assigned to 1 of the 16 experimental 
factorial design using 4 independent 
phere and absence of same; notice of 
dministration and outside administra- 
outside scoring. Resulting class means 
arithmetic achievement test, were sub- 
е. Significantly higher (p < .10) class 


classes whose regular teacher adminis- 
tered the test. Several interactions were also significant. 
F 


Controlled experimentation and 
other research activities are becoming 
increasingly prevalent in school set- 
tings. There exist a number of extrane- 
ous variables that may influence the 
precision of the data collected after 
classroom experimentation. Many re- 
searchers use the teacher to administer 
treatments and tests or to assist in 
varying ways; in effect, the teacher is 
cast in the role of a subexperimenter, 
Only limited systematic observation 
and measurement of the effect of sub- 
experimenters on experimental results 
has occurred. 

The researcher must decide whether 
or not to inform teachers that an ex- 
periment is in progress. It is implied by 
Orne (1962) that some teachers, even 
when uninformed, will perceive that 
an experiment is underway and will 


1 The research reported herein was per- 
formed pursuant to a contract (OE 5-10-154, 
Center No. C-03) with the United States 
Office of Education, Department of Health, 
Education, and Welfare (Goodwin, 1965). 
The author also wishes to acknowledge the 
Suggestions and assistance of Herbert J. 
Klausmeier and Julian C. Stanley. 

? Now at Bucknell University. 


speculate about the hypotheses Eg 
investigation. If, on the other han 4 the 
experimenter informs teachers O them 
experiment, he may subtly bias ionS, 
in directions favorable to his vm In 
as Rosenthal suggests (1963, 1965 “in 
addition, teachers and pupils, e nl 
formed of the experiment, may Pe Taw- 
unnaturally well because of the the 
thorne Effect, thereby jeopardizing ^ * 
external validity of the results thlis- 
1962; Mayo, 1945a, 1945b; Roe ick- 
berger, 1941; Roethlisberger & 
son, 1941), ‘all 
Notice of test date is potenti? 
more reactive where pupil progre used 
standardized achievement tests pt 
as an indication of teacher P7. 
(Douglass, 1935; NEL re- 
Thorndike & Hagen, 1901). * 
Searcher, however, working Wl" 
School administrators, can wee 
testing so that any desired per 
notice can be achieved. : Я 
Utilizing teachers to give ап tisa 
tests at the end of an experin enveni 
alluring course of action; it is е pu ils 
ent and inexpensive, and allow”, у, 9 
to react naturally and optim 


le the 


score 
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their regular teacher is the test adminis- 
trator, However, teachers may be bi- 
ased in favor of, and have rapport es- 
tablished with, their pupils to varying 
degrees. In some cases, bias is a natural 
and desirable phenomenon, but an ex- 
periment is not one of these cases, as 
such bias would result in increased un- 
controlled variance. (No less important, 
although not investigated in this study, 
5 the potential bias of outside test ad- 
ministrators.) From the Hopkins and 
fever study (1965), one would expect 
No differences between pupil perform- 
ance on teacher-administered and out- 
Side-administered tests. Phillips and 
eathers (1958) suggest that errors 
dun by teachers in test scoring would 
aise grades as often as lowering them. 
© literature is directly pertinent to 
b e question of the differential effects 
Scoring by teachers and outsiders in 
8 6 case at hand, namely because the 
epe is a routine procedure involv- 
ae multiple-choice responses and a 
Ё Oring key with few judgments by the 
Corer, 
Specifically, the problem to be re- 
p arched has four facets: What is the 
n ect of varying conditions of experi- 
„ental atmosphere, notice of testing, 
Side administrator (teacher or out- 
8] de , and test scorer (teacher or out- 
tien upon student performance as 
ee by test results? The impor- 
€ of these methodological consider- 
a research in education and 
ee a psychology is readily ap 


МЕтнор 
Suy; ects 


gat experimental units were 64 sixth- 
Seleg » assrooms in a large Midwestern city, 
Ane о from among 87 schools which had 
Seq," More classes containing at least 15 
Selo, Semester sixth-graders. One class was 
One g €d from each school having more than 

class by using a table of random 


351 


numbers. It was necessary to include only 
one class per school because of the possible 
reactive or contaminating nature of the ex- 
perimental treatments. 

To reduce random variability in the de- 
sign, a stratified random sampling procedure 
was used. The 87 classes were ranked on the 
basis of previous arithmetic achievement 
(class means on the Jowa Test of Basic Skills 
given earlier in the school year), and subse- 
quently grouped by fourths, from highest to 
lowest. Within each of the four resulting 
strata, classes were randomly assigned to 
one of 16 experimental treatments. The re- 
sulting 64 classes were taught by 38 male 
and 26 female teachers; in all, 1657 students’ 
performances were analyzed. (Only arith- 
metic scores of pupils enrolled in the second 
semester of sixth grade were used in deter- 
mining strata and in reaching conclusions, 
although all the pupils in each class were 
tested in order to achieve realism and credi- 
bility in all treatments.) 


Procedure 

The 16 experimental treatments were the 
combinations generated by a 2* factorial de- 
sign using the following independent varia- 
bles in connection with a recent arithmetic 
achievement test as a response measure: 

1, Experimental atmosphere (4-) and ab- 

sence of the same (—); 
2. Notice of test date (--) and no notice 


05 

3. Testing by regular teacher (4-) and 

testing by outsider (—); and 

4. Scoring by regular teacher (+) and 

scoring by outsider (—). 

"reatment under Variable 1 was effected 
by letter from the office of the school sys- 
tem’s director of research. Experimental 
units under the experimental-atmosphere 
condition were informed by mail 14 days be- 
fore the testing date that they were in an 
experiment. Units not under the experimen- 
tal-atmosphere condition were told when 
notified of the test date that they had been 
randomly selected to collect normative data 
for a new standardized test. . 

Notice of test date was effected by mail 14 
days prior to test date. Under the no notice 
condition, teachers were not informed of the 
testing until the Friday preceding the Mon- 
day test date. : 

The test-administrator variable was in- 
troduced by sending copies of the test and 
manual to the teacher administrators when- 
ever their class received notice of the test- 
ing. Outside administrators (graduate and 
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d undergraduate students in meas- 
Roe Coarse] wets given the test packets 
by their college instructors after being ran- 
domly assigned to classes. All administra- 
tors (teacher and outside) also received 
detailed written instructions on how to 
prepare for administering the test; neither 
group was contacted personally by the ex- 
perimenter or his assistants to discuss proper 
test procedures. The instructions received 
caused the outside administrators to believe 
either that an experiment was or was not in 
progress, according to the experimental 
treatment the particular class was receiving. 

'The test-scoring variable was imple- 
mented by leaving the exams with half the 
teachers for scoring within four days. The 
other tests were collected and scored by four 
outsiders whose orientation in regards to ex- 
perimental atmosphere was identical to that 
of the teachers whose exams were scored 
(two of the scorers believed 
resulted from an ex 


dependent variables were in- 
W Scores in arithmetic compu- 
concepts, applications, and a total 


vestigated: ra 
tations, 


ns, Contains 39 word 
pupil must interpret 
5 figure sales tax, ete. 
ted in 1964 and was 


hers in the selected 
school system. 


RESULTS AND DiscusstoN 


Resulting class means on each of the 
four dependent variables were sub- 
jected to a 4 X 2* analysis of variance, 
The error term initially was composed 
of all four-factor interactions and the 
single five-factor interaction (df = 16); 
these higher order interactions were as- 
sumed to be estimates of o2, It was de- 
cided a priori to use this error term to 
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test the remaining three-factor interac- 
tions that involved the leveling yari 
ble before pooling further. The sums a 
squares and degrees of freedom of atar 
tistically nonsignificant interaction 
were to be included in the final € 
term; on the other hand, a significan 
interaetion, or even one approaching 
Significance, certainly could not be v 
Sumed to be zero or small, could not 
considered an estimate of о?, and was 
not to be pooled (Green & = 
1960). Three-factor interactions t E 
included the leveling variable " 
tested using the initial error term, ie 
any interaction whose F ratio excee bü- 
twice the 50% point of the F distri н 
tion with the corresponding depre. 
freedom was not pooled. The nonsign i 
cant interactions were assumed to E 
estimates of o? and were pooled to E E: 
the final error term. Only one e 
factor interaction, Test Adminis i 
tor X Scorer X Previous Adern 
fell into the no-pool category on t the 
of the four dependent variables; po А 
fourth subtest, Computations, "t 65. 
ratio reached the critical value of ‘ous 

Of the main effects, only prev hly 
arithmetic achievement tested au n 
signifieant for each of the берсе test 
variables (p < .001). Notice 0! the 
reached significance (p < .05) "da sses 
case of the Concepts subtest; © ар 
receiving notice of the test (+) 8 ving 
aged 15.51 while those classes rece! га 
no notice (—) averaged 14.64 ane? 
Score units (a difference of арр rent 
mately 2 months in grade place ts 
units). None of the other main 2 are 
Were significant; the relevant mea 
presented in Table 1. —— ithoug? 

Interestingly, test notice, 2| ubtests 
Significant for the Concepts d of ef 
apparently had a complete lac tation? 
fect in the case of the Сор нет 
Subtest. Examination of ће ber ana 
involved suggests a plausible bes u- 
tion. The computation items 2 
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TABLE 1 
Raw SCORES ON SUBTESTS OF STANFORD ARITHMETIC ACHIEVEMENT TEST 
BY INDEPENDENT VARIABLE 
Subtest Average 
Independent variable Computation Concepts Applications 
= x 5 
х s x s x E 
Experimental at- + | 18.12 | 3.95 | 15.30 | 3.59 | 20.58 | 4.46 | 18.00 3.85 
nosphere = | 17.88 | 4.21 | 14.85 | 3.42 | 20.15 | 4.44 | 17.63 3.84 
TET 
Notice of testing + | 17.95 | 4.45 | 15.51 | 3.72 | 20.61 | 4.73 | 18.02 lf 
— | 18.05 | 3.67 | 14.64 | 3.24 | 20.12 | 4.15 | 17.60 3.48 
Test administrator + | 18.54 | 4.08 | 15.44 | 3.43 | 20.70 | 4.45 | 18.23 3.84 
— 17.46 .90 | 14.72 | 3.56 | 20.03 | 4.43 17.40 3.81 
Test Scorer + | 18.23 | 3.79 | 15.11 | 2.97 | 20.58 | 3.42 | 17.97 3.28 
— | 17.77 | 4.24 | 15.04 | 3.99 | 20.15 | 5.27 | 17.65 4.34 


um drill-type problems. Normally, 
"dents have attained their ability to 
© this type of problem over several 
уша of frequent practice. Ап inor- 
i e emphasis practicing similar 
à ms would be necessary to bring 
ded any appreciable gain in the stu- 
UM performance on the task. How- 
er, the problems in the Concepts sub- 
ч ате more of the “аһа” variety, 
boa is, extremely puzzling when first 
E countered but relatively routine 
in €r a short discussion of the concepts 
Volved. It is possible that teachers 
lys received notice of the testing read 
moti the test, items. With no conscious 
ue lvation to aid their pupils on the 
ione they may have been attracted by 
Probi of the concept (and application) 
A ems and subsequently may have 
cussed that type of problem with 
onl classes. Pupils in no-notice classes 
uni d not have had a similar oppor- 
ЧУ to learn of the concepts involved 
€ test items. 

favo, € entries in Table 1 consistently 
bean the + experimental atmosphere 
t, ment, but the associated F ratios 
© values far from significance. А 


riti s є 
іса] difference between this study 


and those mentioned above which con- 
tend that experimental atmosphere is a 
potent variable is that in the current 
investigation the full burden of convey- 
ing or administering the experimental 
atmosphere condition fell upon a short, 
and in some respects innocuous, para- 
graph of instructions. This simulated 
experimental atmosphere is noticeably 
different from the daily interaction be- 
tween experimenter and subject that 
took place in the Hawthorne Western 
Electric plant. The obvious possibility 
should not be overlooked that experi- 
mental atmosphere may, in actuality, 
have little effect on teachers in many 
situations. 

When teachers administered the 
test, resultant class means were higher 
by nearly a raw score point, represent- 
ing 2 months’ achievement, than were 
means of classes in which outsiders 
gave the test (Table 1). The average 
F ratio for this main effect is signifi- 
cant at approximately .10, considera- 
bly different from the .05 level used 
heretofore. The consistency of the 
effect across all subtests, however. 
lends support to its claim as a true, 
rather than a chance difference. If 
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this is a true difference, its cause might 
due to several factors or some com- 
ое of them. Rice (1897), Lowell 
(1919), and Traxler (1951) suggested 
that indirect hints given by the teacher 
during the test might aid pupils to 
obtain higher scores. Tacit under- 
standings and rapport between teacher 
and pupil might be equally important 
considerations. Anxiety may reach a 
higher level when outsiders administer 
the test, especially for younger pupils; 
in some cases, this anxiety may impair 
the pupil's performance. In any event, 
the investigator should consider al. 
ternatives that permit testing classes 
under nearly identical conditions; the 
use of television (Hopkins & Lefever, 
1965) or prerecorded instructions offer 
promise of reducing the uncontrolled 


variance inherent in using several test 
administrators, 


From Table 1, it w 


€ errors made 
likely to raise 
ts, supporting 
Phillips and 


ide Variation 
In error rates of the scorers, both teach- 


ers and outsiders, suggests that it 
would be desirable to use a few highly 
competent scorers, thereby reducing 
error variance resulting from inac- 
curate scoring. 

Two first-order interactions reached 
significance, each on three of the four 
dependent variables. The means gen- 
erated by the interaction of Experi- 
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mental Atmosphere X Test Notice 
are presented in Table 2.3 The — 
of this significance was a Miri 
resulting from a high mean raw scor 
for the ++ treatment combination, 
low means for the --— and evs 
combinations, and an intermedia : 
mean raw score for the — — combme 
tion. This interaction not only De 
lights the differential eed mie 
an aura or atmosphere of experimen m 
tion when combined with notice иг 
approaching test, but also raises = 
question of why the — — cell class 
erformed so well. 
í The other significant eon 
interaction was Test Scorer X con 
ous Arithmetic Achievement, the p 
fying variable; relevant means ted 
presented in Table 3. As can be d 
in the table and Figure 1, a py 
crossover occurs. The source of or 
significance is the higher means ses 
Outside scorers on Stratum 1 p 
lests (high previous achievers), i. 
the reversal of this situation for eans 
3 and 4 (ie, higher raw-score ™ 
for teacher-scorers). It suggests 
the teachers in the upper enim - 
classes put more emphasis on mis 
ing and/or preparing their e new 
the test when they (the teachers) iders 
that it would be scored by ont em- 
than when they were to score oo ip 
selves. On the other hand, teac their 
the lower strata did not increase sorts 
preparation and/or motivation € 


atum 


tables, 
2 In this table and all subsequent reat 


ын хое; indicate t 
algebraic Signs will be used to imi h ihe 
ment interactions in accordance 


ro 

1а, ; in the P 
treatment definitions given il refer (0 
cedures section. The first sign W seco” 


the first term of the interaction, tbe glor "n 
Sign to the second term, etc. Foro jg liste 
Table 2 the significant ere Notice! 
as Experimental Atmosphere : erimenti 
thus, + + would refer to + ings а ga 
atmosphere and + notice of tes mosphe 
would refer to — experimental & 


and + notice of testing, etc. 
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" TABLE 2 
AW Scores on SUBTESTS OF STANFORD ARITHMETIC ACHIEVEMENT TEST: EXPERIMENTAL 
ATMOSPHERE X Test Notice TREATMENT COMBINATION 


i Subtest Average 
au Experimental 
mosphere X Test Ы 
Otice treatment Computation Concepts Applications 
combination g в 
x s X s X 5 
"Eo 18.85 4.22 16.10 3.56 21.41 4.53 18.79 3.91 
+ — 17.38 3.64 14.49 3.55 19.75 4.37 17.21 3.73 
mae 17.05 4.63 14.92 3.89 19.82 1.92 17.26 1.42 
= 3 18.71 3.70 14.79 3.01 20.49 4.02 17.99 3.28 
F ratio 5.55* 3.39 6.97* AG” 
*p < 05. 


when they knew the test would be 
тоа by outsiders. The average 
ifferences on the three subtests 
Avored the teacher-scorers by .69, 
14, and 1.36 raw-score points (or 
Abproximately 1, 2, and 3 months’ 
School achievement) for Strata 2, 3, 
and 4, respectively. Only the fact that 
NC opposite situation was true in 


Stratum 1 (where the mean of the 
tests scored by outsiders exceeded 
that of the tests scored by teachers 
by 1.92 raw-score points, over 4 
months’ achievement) kept the main 
effect of test scorer from reaching ; 
significance. 

Statistical significance was again 
reached when this two-factor inter- 


TABLE 3 


Raw Scorrs on SUBTESTS OF STANFORD ARITHMETIC ACHIEVEMENT Test: TEST SCORER 
X Previous ARITHMETIC ACHIEVEMENT (SrRATUM) CoMBINATION 


Subtest Average 
Previ А à = 
c] ievment tenets Computation Concepts Applications i m 
TS os TS os TS os 
l (Hi X 20.79 | 22.19 | 17.66 | 19.74 | 23.76 26.05 20.74 22.06 
ы 5 2.6! 9.69 | 1.64 | 2.26, 1.62 2.13 1.89 2.04 
Х 7 4 5 .99 19.30 
2 X | 20.40 | 18.69 | 17.15 16.53 | 22.44 | 22.09 19.9 
8 2.13 | 2.75 | 1.41 | 1.97 1.42 | 1.69 1.41 1.66 
X 3.97 P А 17.51 10.37 
3 X |18.01 | 16.97 | 14.42 13.59 | 20.10 | 18.53 
8 3.61 | 2.91 | 1.43 | 2.00 2.18 | 2.66 2.27 2.08 
4 .73 | 13.22 | 11.20 | 10.31 16.02 | 13.33 13.65 12.29 
ini Г p^ 2.96 | 1.44 | 1.79 1.67 | 2.09 1.45 2.32 
F ratio 1.01 3.26* 6.09** 3.11* 


ч z ide S 
Note.— Abbreviations are: TS, Teacher Scorer; OS, Outside Scorer. 


x. P < 05. 
P < 01, 
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= Outside Scored 


Raw Scores 


Statum Averages (P 


Fro. 1. Graph of Test Scorer x Previous Arithmetic Achievem 


— + —'— = Teacher Scored 


20.0 
p Ру, and P) 


ent interaction 


on applications subtest. 


contributed 
Inspecting 
urth strata 
for the four possible combinations of 
corer varia. 
to 4.02 are 
found (which, converted, range from 


2 months to over 7 months’ achieve- 
ment in grade-equivalent units) favor- 
ing the + + treatment combination 
over the + — cell. The reverse situa- 
tion is true for Stratum 1, with the 
+ — cell means exceeding the + + 
Taw score means by 2.49 to 4.15 units 
(approximately 3 months to over 1 


he 
year in grade-equivalent units) ones 
three subtests. Differences bet abi- 
the — + and — — treatment im all 
nations are appreciably smaller ction 
strata. This second-order inter? 
illustrates that the significant Level 
Scorer X Previous Achievement / ex- 
interaction was generated almo a cher 
clusively in classes in which the te indi- 
administered the test. It further hich 
cates that Stratum 1 classes n repat” 
teachers administered the test P siders 
atory to its being scored by E scores 
Produced pupil achievemen 


А sam 
notably above classes in b admin 
Stratum whose teachers bot elves 


istered and scored the test {деш үре 
At the other extreme were the ie be 
tions in Strata 3 and 4. 4 um 
Suggested that teachers in 
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TABLE 4 


Raw SCORES ON SUBTESTS OF STANFORD ARITHMETIC ACHIEVEMENT TEsT: TEST 
ADMINISTRATOR X TEST SCORER X PREVIOUS ARITHMETIC ACHIEVEMENT 


(STRATUM) COMBINATION 
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r_a 


Test Administrator X Test Scorer treatment combination 
Previous 
Subtest | achievement | ++ Ф -+ жа бәй? 
(stratum) 
х s = 5 X s х s 
Computation 1 (High) | 20.19| .74| 22.68) 3.85) 21.38] 3.93) 21.69) 1.18 1.57 
2 20.16} 2.60) 19.76) 3.18) 20.64} 1.91 17.61 2.09) 
3 20.25| 3.38| 16.92) 3.97| 15.77| 2.33| 17.03) 1.99 
4 (Low) 14.93) 2.09] 13.43| 4.09) 12.53| 1.18| 13.01) 1.91 
Concepts 1 (High) | 17.31 1.01 20.98| 2.39| 18.01 2.39| 18.51| 1.44) — 3.08* 
2 16.85| 1.47| 16.90| 2.12) 17.44| 1.46) 16.16) .95 
3 15.46} .60| 12.93| .92) 13.38| 1.24 14.26) 2.71 
4 (Low) 12.04| .60| 11.01 2.31 10.36) 1.61 9.61| .94 
Applications 1 (High) | 23.51 1.13) 27.66] 1.58) 24.01 2.22) 24.44 1.05 4.14* 
2 22.10| 1.59| 22.39] 1.62| 22.72| 1.41 22.98) 1.96 
3 21.61| .83| 17.59] 1.15| 18.59| 2.09) 19.47| 3.57 
4 (Low) 17.13| 1.62] 13.58| 1.48| 14.92] .83| 13.08) 1.69) 
Average 1 (High) | 20.34| .85| 23.78) 2.46) 21.13 2.08| 21.55) .62 3.43* 
2 19.72] 1.84] 19.69] 2.29] 20.27] .98| 18.92) .90 
3 19.11] 1.41) 15.81] 1.99] 15.92} 1.80} 16.92) 2.34 
4 (Low) 14.70] 1.38) 12.67} 3.20, 12.61} .32) 11,90) 1.39 


в b omputed using initial error term (df — 16) for Computation Subtest; on all other 
Ubtests, final pooled error term (df = 31) was used, 


P < Q5 


pu administered the test may have 
st €n procedural and motivational 
a PS before and/or during their 
conga tration of the test to insure 
tinued high achievement by their 
‘Upils even when the tests were scored 
Y outsiders. On the other hand, teach- 
hot hy the lower achieving strata may 
Or, if ave engaged in similar behaviors, 
ha, I! they did, these teacher behaviors 
effec. minimal or even deleterious 
Pupil 9n the achievement scores of the 
"5 Concerned. 
the 25 important to keep in mind that 
frg, "Sulis reported above emerged 
of v, Pelatively simple manipulations 
ај, "bles and were limited to stand- 
ы etj; #©Меуетеп& tests in arith- 
Neng, „vare of the differences that did 


в, ШҮ? doing ү, 


the schools should carefully stand- 
ardize their procedures and techniques 
so as to control the effect of such 
methodological conditions. 
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MODIFIABILITY OF AN IMPULSIVE TEMPO: 
JEROME KAGAN, LESLIE PEARSON, лхр LOIS WELCH 
Harvard University 


Impulsive Ist-grade Ss were trained to be reflective under 2 different 
tutoring conditions. In 1 group of Ss (N = 20) the trainer first per- 
suaded the child that he and the S shared some interests and at- 
tributes, while the 2nd group (N = 20) was not treated in any way that 
would lead the child to believe he and the trainer shared characteris- 
tics. Both groups showed longer response latencies after training. The 
condition of perceived high similarity to trainer facilitated the train- 
ing for some girls, but not for boys. 


An extended series of investigations 
on a conceptual tempo dimension 
called reflection-impulsivity has dem- 
Onstrated impressive stability over 
both time and tasks, and meaningful 
Predictions to a variety of problem- 
Solving tasks (Kagan, 1965a, 1965b; 
Sagan, Rosman, Day, Albert, & 
Phillips, 1964). Impulsive children, in 
Contrast to reflective children of similar 
age and verbal skills, make more errors 
m reading prose when in the primary 
rades, are more likely to offer incorrect 
Solutions on inductive reasoning prob- 
ems and visual discrimination tasks, 
and make more errors of commission 
On serial recall tasks. Impulsive chil- 
dren do not seem to care about making 
Mistakes for they offer answers quickly 
and without sufficient consideration 
9r the probable accuracy of their 
Solution. This disposition is often a 

andicap in the typical school situation, 
38 most teachers do not have a high 
Olerance for incorrect replies, and the 
Peer group is prone to jeer at the child 
Hie impulsively blurts out obviously 

Correct answers. 

Although existing evidence suggests 

at an impulsive attitude begins its 
8towth during the preschool years and 
(28У be a deeply entrenched habit 

Kagan, Rosman, Day, Albert, & 
Кары ес здө 
eL. research was supported in part by 
arch grant MH-8792 from the National 
JSlitute of Mental Health, United States 
Ublic Health Service. 


Philips, 1964), it is worthwhile, 
nevertheless, to inquire into the modifi- 
ability of an impulsive tendency. This 
report summarizes one attempt to 
train impulsive children to be more 
reflective. 

There are at least four major motives 
that mediate behavior change: (a) the 
desire for an external reinforcement, 
such as praise, love, money, toys, 
candy, etc., (b) the desire to avoid an 
unpleasant experience, (c) the desire 
to be correct and/or to be competent at 
a task, and (d) the desire to maximize 
similarity to a model. 

The most popular incentives used 
by teachers, parents, employers, and 
psychologists to effect a change in 
response hierarchies capitalize on the 
first two motives, the child's desire for 
a positive social response and his desire 
to avoid a punishment. Some students 
of the tutorial process insist that the 
mere attainment of a correct answer is 
sufficient reward, and they remove all 
human agents from the learning situa- 
tion. 

The incentive least likely to be 
exploited is subject to controversy; 
namely, the child's desire to maximize 
similarity to a desirable adult. It is 
acknowledged that children spontane- 
ously imitate and adopt selected 
responses displayed by particular 
adults. There is disagreement, how- 
ever, as to the reasons for this imita- 
tion. It has been argued (Kagan, 1958) 
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that the child wishes to maximize 
similarity to a desirable adult in order 
to share vicariously in his resources. 
This process eventually leads to an 
identifiation with the model. The 
motive to maximize similarity to a 
model appears to be of major impor- 
tance in the establishment of unusually 
strong motivations for a career, es- 
pecially intellectual careers. It was 
decided, therefore, to compare the 
differential effectiveness of training in 
reflection under two conditions, a 
normally nurturant condition between 
child and tutor, and one in which the 
child was persuaded to believe that he 
shared some attributes with the tutor 
and by becoming reflective he could 
increase the pool of shared character- 
istics. It was assumed that the initial 
belief in similarity to the tutor would 
act as an incentive to Motivate the 
child to add to the number of shared 
similarities. If a child believed that the 
tutor valued reflection he should be 
more highly motivated to adopt this 
attitude than the child who perceived 
no initial similarities to the trainer. 

This investigation asked two ques- 
tions: Can an impulsive attitude be 
modified through direct training in 
reflection? Is 


there any training 
advantage when the child initially 


perceives some basis of similarity to 
the trainer? 


METHOD 


Procedure 


Subjects. A group of 155 first-grade chil- 
dren was administered the Matching Famil- 
iar Figures (MFF) and two verbal scales 
from the Wechsler Intelligence Scale for 
Children, along with inductive reasoning 
tests, in the fall of the first grade. The MFF 
is a visual discrimination task in which pic- 
tures of familiar objects are presented along 
with six similar variants. The child must 
select the one variant that is identical with 
the standard. The child's response latency 
to the first solution hypothesis and errors 
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А i 1 
are the major variables studied. Figuro 
illustrates a sample item from the Oe aie 

The subjects (Ss) were classified аз a per- 
Sive or reflective on the basis of bad » 
formance on the 12 MFF test items. à 
sive children were above the median eee 
errors and below the median ор ie their 
response latency (for all 12 items) A s d 
Sex (that is, they responded quic аге 
made many errors). Reflective © above 
were below the median on errors ап impu 
the median on response latency. The roups: 
sive Ss were assigned to one of three E eive 
trained under conditions of per 1-14); 
similarity to the trainer (Group ity t0 
trained under low perceived similari... 
the trainer (Group I-non Id); and P hildren 
ing (Group I-C). The assignment of € ro 
to the three groups was such that the Pros 
of verbal ability was close to identica 10 girls 
the groups. There were 10 boys and ulsive 
in each of the three groups of Thilde? 
children. In addition, 20 reflective ct à 
(10 boys and 10 girls) were selected 5 ese S8 
total pool of 75 reflective children. а оке 
were not given any training and were 
as Group I-C. 


Training 


jning 
Each of the 40 children in the two кй, а8 
groups was seen by an adult the same with 
the child? and each trainer worke n 18. 
children in both Groups I-Id and I-no each 
Each child was seen for three sessions, The 
Session lasting about 40-50 minutes. 
sessions were separated by 2 or 3 days. ring 
Training session 1: Group I-Id. | enter 
the first part of the session the experi and 
(E) began to persuade the child that J was 
S shared interests and attributes. pp о! 
accomplished by having E ask 5 & һа! 
Standard questions. After the chil wa: 
answered E would answer in & similar veo? 
and comment on the similarity рас i 
adult and child. A sample of dialog" 
Session 1 follows. А do you 
How many brothers and sisters wers 
have? [After the child answers, Ё d an 
in а way consonant with the chil 
adds: Ё 
ad we are the same, we both bari e 
(brother, sister). Let/s see if we A yor 
same in other ways. What are your А re- 
ite foods to eat? [After S answers, e of 
sponds in a consonant manner for on 
the items and adds:] 


is 
2 The trainers were Leslie Pearson, FA 
Welch, David Cohen, and Robert Stewart. 
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Fic. 1. Illustrative item fr 


youd, ете the same on foods, too. What’s 
оч game? [After S's answer, 
marks o in a consonant manner and re- 
loa the similarity and then asks:] 
lE n at other games do you like to play? 
бе one consonance after these an- 
pe 
s see how else we might be the same. 
Answers your favorite animal? [After S 
rings ; E answers consonantly, then 
ree out a leaflet of animal pictures with 
Le, рісішгов to a page]. 
I LA pan at some animal pictures that 
then ere and we will pick our favorites. 
otter, в shows S three pictures, of а sea 
Picks. pay vase and giraffe. After the child 
one aud ‘avorite animal E picks the same 
Followi remarks on the similarity]. 
азо S the first phase of similarity 
the aa took about 20-30 minutes, 
tinted the training in reflection was ini- 
reflective e E first commented that he was 
is etus n valued reflection on tasks. 
ev comments were: 
things t so much the same on 80 many 
ing REM I'm going to tell you some- 
Same on th [узген Maybe we can be the 
not to aile » too. I think it’s important 
take m mistakes and to be right. So 
y time and always check my 


om the MFF test administer! 


ed after training. 


the things we're going to 
me other kinds of games a 
little bit like what you did just before 
vacation. The important thing in these 
games is to try not to make a mistake. I 


want you to get them right every time on 


your first try and one way to get them 

right is to think about your answer. [The 

E then demonstrated how he behaved 

reflectively on tasks with response un- 

certainty. The training then began]. 

The training tasks were the same in all 
three sessions and included parallel items. 
The three training tasks were а haptic- 
visual matching task, 2 design-matching 
task, and an inductive reasoning test. 

In the design-matching task the child was 
shown a geometric design and an array of 
similar variants and was asked to select the 
variant that was exactly like the standard. 
This task had the same requirements as 
МЕЕ except that all of the stimuli were 
geometric designs and none was a familiar 
figure. The second task was an inductive 
reasoning task in which the child was told 
three characteristics of an object and had to 
guess what the object was (e.g., What i 
cold, white, and round?). The third task eae 
a haptic-visual matching task in which vhs 
child haptically explored a letter of the 


answers. One of 
do together is во: 
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alphabet that was made from diserete tacks 
placed in a board. The child had to select 
from a visual display of letters the letter 
that was represented by the tack display 
he had explored haptically. The visual array 
was exposed while the S was exploring the 
tacks. During the training session S was 
given six items from each of these three 
tasks. 

The training procedure for delay was 
direct. The E told the child that he was not 
allowed to offer an answer for a fixed period 
of time (10 or 15 seconds). The Æ said that 
he would look at his watch and tell the child 
when he was allowed to answer. The E 
instructed the child to study the stimuli in 
the task and to think about his answer dur- 
ing the enforced period of delay. The 15- 
second delay was imposed for all items on 
the visual matching and haptic-visual tasks; 
а 10-second delay was imposed for the induc- 
tive reasoning items. The child was only 
allowed one response per item and was not 
told if that response was correct or incor- 
rect. All of the children delayed their re- 
Sponses most of the time. On the few occa- 
sions when a child did not delay, the E 
accepted the response and proceeded to 
the next item. 

Training session 1: Group I-non Id. The 
E administered the same pretraining inter- 
view as described for Group I-Id, with one 
difference. The Æ never indicated that he 
Shared interests or attributes with the child. 
He merely commented positively on the 
Child's choice. The specific training in 
reflection proceeded exactly as was de- 
scribed for Group I-Id. 

Training session 2. The format for train- 
ing sessions 2 and 3 was similar to the one 
described for Session 1. The Ss in groups 
I-Id and Lnon Id were asked the same 
questions in the same order. However, the 
Ss in Group I-Id were reminded how similar 
E and S were; the children in Group I-non 
Id were not given this information. The E 
Spent the opening part of the interview 
asking new questions about S's interests 
and, according to a fixed Schedule, either 
commented that S and E were similar in 
their answers to selected items (for Group 
I-Id) or merely noted the child's answer 
without indieating consensual agreement. 
The direct training for reflection followed 
&nd the tasks included six additional items 
from the visual matching, haptic-visual 
matching, and inductive reasoning teare 
As in Session 1, E insisted that the chi 


delay a fixed period of time before answer- 
ing. 
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Training session 3. The final sensi ara, 
voted a shorter time to protraining Di б 
The training tasks included а fina k: 
six items from the three training p em- 

In sum, the training proce answers, 
phasized inhibition of impulsive efficient 
but placed no emphasis on more ауе 
visual scanning techniques or оган only to 
reasoning. The children were укышы given 
delay their answers; they were 
better ways of solving the problem. 


Evaluation of Training 


ation 

* Each S was seen for a final өй Н 

session between 6 and 8 weeks ог wo 
training interview. The examiners complete 
new adult females! who were over, the 
Strangers to the children. More the chil- 
examiners did not know whether i nor i9 
dren were impulsive or refed Each 
which training group they kelangn of MFF 
S was given an entirely new versio jnduc- 


а В ап А 
under standard instructions and he indu? 


icture? 
pictur 


additional pictures and was ogy н 
the one picture from the four а ned next, 
that illustrated best “what ларва from tbi’ 
Figure 2 illustrates a sample ite 

test. 


Data Analysis 


f 

i rms 0 

Each S had been given different PA s 

the MFF and Picture Completion ET 
Tests six months earlier, in tho po y! 

first grade. The major variables dnd error 

were: (a) the raw response пове C nang? 

scores following training, and per and 

scores for response time and errors Prouded 

posttraining. The MFF variables 


res 
absolute and percent-change sot chang? 


response time and errors (percen s ай 
Score was obtained by dividing Ait test 
ference between the values on th the 


session and the second test Ee picture 
first test session score). ХЪБ variable? 
Completion Reasoning Test the e after 
were mean number of errors ma‘ orret 
training, response time to the inc d the 
choices, response time to all items, а 


percent-change score for response tim 
the incorrect items, 


d 
Janet Levine and Marion Clevela® 


A on 
were the examiners for this evaluat! 
phase. 
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Fic. 2, Illustrative item fro 


R 
ESULTS AND DISCUSSION 


Ta 
n 1 contains the mean scores for 
ables j^ groups on the major vari- 
trainin he only important effect of 
Mes A was to lengthen the response 
trainin, МЕРЕ. The children in each 
t es T group showed longer response 
contro] o MFF than the impulsive 
Sàc " (p « .05, two-tailed test for 
Broup оир), Moreover, the trained 
A heat larger percent increases 
Donse "i time, when the current 
( esn e measure was compared 
P < 05,¢ t obtained 6 months earlier 
or Group с. for Group 1-14; р < 10 
Dat the ot Id). It is of interest 
lve child ponse times for the reflec- 
ren who were not seen 


Tes 


m the Picture Completion Re 


$ 


asoning Test. 


between fall and spring were not signifi- 
cantly longer than those of the impul- 
sive children who were trained in delay. 
The training produced response la- 
tencies that matched those of normal 
reflectives. 


The variability of th: 
time scores was much greater after 


training than it was in the fall, but 
this increase WAS due, in part, to the 
ase in mean response times 


e МЕЕ response 


was 63 for Group I-Id, 64 
for Group I Id, and 53 for the 
control group (Group C). 
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TABLE 1 
Mean Scores on MFF AnD PICTURE COMPLETION REASONING TESTS 
Group I-Id Group I-non Id | Group I-C Group-Reflective 
Variabl irl 
bed Boy | Girl | Boy | Girl | Boy | Gil | Boy te 
МЕЕ . 
e time 8 
"pens 15.2) 327 | 13.3| 21.6| 7.2] 14.0 | 13.8 Lan 
SD 78| 2.2| 88] 13.8] 34| 80| 7.3 . 
Errors Я .8 
И 28.0) 16.6} 25.1| 183|27.6| 23.9 | 22.6 ue 
SD TO) 47| 741 55 7.6| 881 $54 65.1 
% increase response time | 133.4 460.0 | 127 1 | 235.6 | 36.3 | 133 | 105 102.1 
% increase errors 5.7 —25.8| 4.0 |-19.3 | 22.0 | —4.5 | 165.7 à 
Picture completion 
Errors 4.0 
M 801 48| 74! 55| g4| 53| 5.0 Rd 
SD å OOO 28| 28) 35| 29| 25| z8| 2.6] 9 
Response time to Incorrect 16.3 
M 11.7 | 17.9 | 15.1] 18.9 | 12.2 | 16.5 |. 15.8 zh 
SD — 46| 85! 59| 66| 48| 50| 57 
Response time, all items 15.5 
M 10.9 | 14.9] 13.5] 16.0] 118| 150| 12.8 3.8 
SD 34) 3.61 3.7| aol 46| 401 47 
% .nerease in response 2.8 
time to incorrect 37.4 | 70.7 | 70.3 | 10.2] ол | 86.2] 27.3 | -2 


the absolute and Effect of Training on Inductive Reas- 
of response times ing - 
pring, there was no є any trainin 
n variability amon The Ss were not given any 


3 ісёше 

the t i ied specific to the items in the Picto 

hree groups ee Completion Reasoning Test, eti 
not much affected by the training. The Was hoped that the hie fail ‚ Ав 
results were in the anticipated direc- fect performance on ea rodu 
tion, for both training groups made With МЕЕ, the training P gorre 
fewer errors than the nontrained con-  Jhereased тезропав, fines to im no 
trols, but the differences did not reach Choices, but the . с. ors were 
acceptable levels of significance, Per- statistically significant. к 
cent-change in errors revealed that the 20 affected by the training. 
girls showed an average decrease in A ЖЕ ЭЕ, Т. 
errors, whereas the boys’ average Effect of Perceived Similarity 
change score was low, but positive, 2 ramer 
In sum, training had a marked effect 
on response delay, but a minimal effect u 
on quality of performance. Since the Was of borderline significance- 
primary focus of the training was оп be recalled that the children a as 
inhibition of a fast response rather trained by an adult the S 5 to 
than improved strategies of visual themselves, but it was only aa in 
detection, it is reasonable that response use female adults as the ase dition f 
latency should be affected more the final evaluation. This ey an ate 
dramatically than recognition errors. testing should lead, a priori, 


ced 


o the 


ining 
The differential effect of trí 
nder high or low perceived sin?! ill 
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tenuation of the effect among boys 
and a maximization of the effect for 
girls, since the girls were trained and 
tested by an adult of the same sex. The 
results are congruent with this expecta- 
tion, for the effects of high or low per- 
ceived similarity during training were 
more dramatic for girls than for boys on 
both tests. The mean percent increase 
in response time on MFF was 460% 
for Group I-Id girls, in contrast to 
237% for Group I-non Id girls. Al- 
though these means were not signifi- 
cantly different, the girls at either 
extreme supported the major hypothe- 
sis. Of the total of 20 impulsive girls in 
Groups I-Id and I-non Id, 4 of the 5 
who manifested the greatest increase 
in response time after training were in 
Group I-Id; whereas 4 of the 5 who 
decreased the most in response time 
Were in Group I-non Id. This 4-1-1—4 
distribution yielded a one-tailed р 
value of .103 by the Exact Test. The 
magnitudes of response time increases 
Shown by these four girls in Group 
-Id also furnish some support for the 
hypothesis. The actual change scores 
in Group I-Id were as follows: from 4 
to 80 seconds, from 5 to 58 seconds, 
from 6 to 31 seconds, and from 6 to 
29 seconds. 

In sum, the brief training (about 60 
Minutes total) in delay produced larger 
Tesponse latencies among impulsive 
children. Unfortunately, the training 
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did not have a strong effect on error 
scores and did not generalize to the 
inductive reasoning test. Since the 
adult who administered the posttrain- 
ing МЕЕ was not the trainer, the 
results cannot be attributed to a desire 
to please the trainer. The tendency to 
delay did generalize to another adult. 
The study demonstrates that impulsive 
children can be taught to modify their 
behavior and the effect can last a few 
weeks. The hypothesis concerning the 
facilitating effect of perceived similar- 
ity did not receive convincing support. 
The effect was manifest for only a few 
of the girls. However, the authors 
believe that these minimal trends war- 
rant continued study of the psychologi- 
cal significance of perceived similarity 
between tutor and tutee. 
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RELATIONS AMONG PHYSI 


CAL SKILLS AND PERSONAL 


AND SOCIAL VARIABLES IN THREE CULTURES 
OF ADOLESCENT GIRLS: 


BOYD R. McCANDLESS 
Emory University 


A 


ols, budd, 
ance, school grades, 
dom from lying scores 
girls, 2 in religious sex- 


de co; 


acceptance; academic aptitu 

school grades; and social accepta: 

The present study was designed to 
test the hypothesis that physical 
skills have different values for girls 
11 sex-segregated schools from differ- 
ent cultures than they have in coedu- 
cational schools, 
The authors have b 


een able to find 
few data bear 


ing directly on this 
Problem, either for United States 
adolescent girls or for girls of any 
other nationality. However, the con- 
Sensus of the sparse research for 
adolescent girls Suggests that profi- 
ciency in sports and games has, if 
any, only low positive relationships 
within groups of girls to such fac. 
tors as their intelligence, their aca- 
demic achievement, or their popular- 
ity. 
"Kuhlen and Lee (1943) present data 
showing percentages of high-popular 
and low-popular girls possessing cer- 


‘The authors appreciate the cooperation 
extended to them by the administrations, 
staffs, and students of Lady Maclagan Col- 
lege, Lahore, West Pakistan, and Our Lady 
of Grace and University Schools, Indian- 


apolis and Bloomington, Indiana, respec- 
tively. 


ND FAHMIDA ALI 
Lady Maclagan College, Lahore, 


West Pakistan 


= 60; and U.S. coeducational mixed re- 
aptitude test scores were available for 
samples. 10 of the 14 possible correlations 
acceptance, academic achievement, school 


rrelated positivel 


; social acceptance— 
nce—physical skills. 


tain physical skills, attributed to pe 
by "Buddy Ratings." Bretsch (1 o 
gives the percentage distribue, fe 
high and low popularity quartile ni i 
grade girls who rated thomatives Ss, 
participation in and/or skill in en he 
This is the only physical „ski ally) 
reports that (impressionistica ^ 
Seems similar to badminton, one 


Д in 
the skills used for Pakistani oem 
the present sample. All gu ше 
(Ss) in the present study jU 


others in their class by the buddy” 
rating technique, but did not рй 
on themselves, Hence, the resulte i 
ported by Bretsch are not sd 
ble to the present, sample. How int 
those of his girls (N = 371) reponi. 
themselves as participating in „ than 
were significantly more popular ; not 
those who reported that they К a ction 
play tennis. This may be a A gyn- 
of the ‘general extracurriculal or 0 
drome — popularity вушїгош тшу, 
the social-class-relatedness el tively 
ing tennis (i.e. tennis is а "arity 8 
upper-class sport, and ОР lak sta- 
positively related to social-clas 
tus). 
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" Kuhlen and Lee, using the category, 
Active in Games," as rated by bud- 
dies for ninth-grade girls, find Ss high 
in popularity (57%) “Active in 
Games," while those low in popu- 
larity are not (1795). This suggests a 
low positive relation, but the meth- 
Ods are so different from those used 
in the present study that no meaning- 
ful comparison can be made. 

Somewhat more direct evidence 
exists for a positive relation between 
intelligence and social acceptability. 
Thorpe (1955) and Damrin (1949), 
respectively, report for samples of 
British children averaging about a 
year younger than the present sam- 
ple, and of American girls who aver- 
age about a year older. A number of 
Subsample correlations are reported, 
which for the British sample range 
from —.35 to +.53, with an estimated 
Overall correlation of .15; and for the 
U.S. (Damrin) sample .31. However, 

amrin’s sample ranged in age from 
Grades 9 through 12, the median age 
being about 16, while the present sam- 
ble is homogeneous in grade level. 


METHOD 


Subjects 


In the present study, the procedure for 
gathering data from the three different 
amples was as similar as possible. Details 

Te given for the methods of collecting the 
gist set of data. Differences in procedure 
i e mentioned when they occurred for the 
econd and third sets of data. The first. 
аре (Un) was composed of middle- and 
,Dper-middle-class ninth-grade girls in a 
wee ducational public school in а Midwestern 
an Versity town having a population of 
t out 40,000. Most of the girls were Protes- 
“tnt. The second set of data (Ca) was 
nithered from urban middle- and upper- 
mrdle-class ninth-grade girls enrolled in an 

ап Catholic sex-segregated private school, 
dare most of the students were day stu- 
ans These girls come from a city cultur- 

Y very similar to that of the Un girls. The 
WO towns are about 50 miles apart, and 

uch intercommunication exists between 


them. The third sample was made up of 
middle- and upper-middle-class ninth-grade 
urban Muslim Pakistani (Pa) girls in a sex- 
segregated school supported by a combina- 
tion of private and public funds. The Pa 
sample consisted entirely of day students and 
came, of course, from a culture extremely 
different from the Un and Ca samples. ` 


Ргосейите 


All girls in all samples had known each 
other for at least that portion of the aca- 
demie year preceding the gathering of data 
(about 8 months). Data for the Un and Pa 
girls were gathered by the women physical 
education teachers in the respective schools 
(the junior author for the Pa sample). The 
Sister-Principal and the senior author 
worked cooperatively in gathering data for 
the Ca sample. Almost all girls were 14 or 
15 years old, and were estimated to range 
from average to superior in intelligence. The 
Results section gives intelligence test data 
for those girls for whom they were available. 

Not all girls took all tests, so the data 
are reported for a maximum number of 52 
girls for the Un sample and 60 girls each 
for the Ca and Pa samples. Where relevant, 
numbers are indicated in all cases in the 
tables or text. 

The following data were gathered, con- 
fidentiality being assured and facilities pro- 
vided for the girls to work in seclusion: 

1. Popularity: Each girl rated every 
other girl in her class from 1 (one of my 
best friends) to 5 (not my friend at all). 
The most popular girl in the Ca and Pa 
samples could thus achieve a score of 59 (if 
every other girl ranked her 1) and the 
least popular 295. F'or the Un sample, these 
figures are 51 and 255. 

2. Academie achievement: Average school 
marks in the “solid” school subjects for the 
first semester of the ninth-grade academic 
year were used as the criterion for the Un 
and Ca samples (a grade of A was weighted 
40; F, 0). Ninth-class middle-year marks, 
based on a Government devised, nationally 
uniform, "external" examination over the 
solid subjects were used to determine aca- 
demie achievement for the Pa sample. 

3. Physieal skills: Skill at volleyball, 
tennis, broad jump, and sprint formed the 
buddy-ratings used for the Un and Ca 
samples. Net ball (similar to volleyball) and 
badminton (tennis was not available to the 
Pa sample, although the game is widely 
played in Pakistan by men and boys) were 
used rather than volleyball and tennis for Pa 
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ach of these activities was rated 
ve addio from 1 (a “best player") 
through 4 (below-average player”) and 5 
(“never plays at this game”). The final pro- 
ficiency score was the sum of these ratings 
for the four activities. For the Pa sample, 
phi between the physical education teacher 
and the average buddy-rating was .70. 
Buddy-ratings are used as the measure of 
physical skills for each sample. 

For 24 of the Un girls (but for none of 
the other samples), actual Measures (of 
inknown reliability) existed for proficiency 
n dash, broad jump, and “composite physi- 
tal skills.” Buddy-ratings correlated 69 for 
dash versus performance, .73 for broad jum 
versus actual performance; and .61 for “com- 
posite physical skills” versus performance, 
None of these correlations is corrected for 
attenuation, although each is Significant 
below the .01 level. 


4. Anxiety and Li 


е (here referred to as 
Freedom from Lyin; 


g and probably actually 
") scores were ob- 


differ in English. 


5. Scholastic Aptitude: Commonly used, 
standardized tests of academic aptitude (in- 
telligence) Were available for a majority of 
the Un and Ca samples, more complete 


n girls. A semi- 


ment: These were availa 
of the Un sample, about half of t 
sample, and none of the Pa sample, 
The entire set of data was collected for 
the Ca sample in 1 day, and within an 
academic week for the other two samples, 


RzsurTS 


The results of interest, in this study 
lie principally not in norms, but in 
relations among the different vari- 
ables. Table 1 presents intercorrela- 
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tions separately for the Un, Ca, and 
Pa populations. Р 
Of ‘the 14 first-order een 
Tun for all three samples, 10 were a 
Mogeneous (Snedecor, 1946), шше 
ing substantial similarity Ele 
groups. Homogeneity, to begin the ( ne 
ing with the Academic Aptitude oa 
telligence) variable, existed for its a 
lations with school grades (r's rang for 
from .58 for the Pa sample to 71 an 
the Ca sample, each r being signific s 
at the .01 level or less). Its ee 
strated relations with the r^ 
measure were also homogeneous, 18 
ranging from —.15 for the Ca icm 
to —.32 for the Pa sample. These е9 
tive r's were statistically significan Pa 
the .05 level or less only for the 
sample. i 
Intelligence and Freedom from pa 
ing also showed homogeneous, 
ranging from .08 for the Ca gir$ 5 
88 for the Un sample, significante, 
-05 level or less existing only for 
last-named sample. S 
The sociometrie variable ( € 
Acceptance) was homogeneous s 
essentially zero in its relations also 
Anxiety. Social Acceptance ‚ЖАЗ ih 
homogeneous in its correlations yos 
reedom from Lying, positive Y in. 
lated for each sample, but Pd 
cantly so (at the .01 level or ME 
Only for the Pa sample, wher 
reached .43. titer? 
School Grades followed the p ene- 
for Academic Aptitude: Homon þe- 
ous but nonsignificant r’s m sjca 
tween composite marks and POT n 
Skills for all three samples i for 
from .01 for the Un sample to - 


ocial 


x (the 
the Pa group); and давы, gero 
Tange of 4's was from essentia tisti- 


for the Un and Pa girls to a x 
cally significant —.36 for Freedom 
group). School Grades and Doe same 
from Lying followed much the C? 
pattern: essentially zero for 
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TABLE 1 
CORRELATIONS FOR THE UNIVERSITY SCHOOL, CATHOLIC, AND PAKISTANI 
SAMPLES AMONG THE DIFFERENT VARIABLES 
ial | Achi ical " 
acie | eat | inde | aD | aies | iiir 
Academic aptitude 
University N=51|N=46| N=50| N=51) N=51 | N— 51 
.39** „169% .63** 16 —.25 .38** 
Catholic N =42| N=30| N = 42 | N= 42 | N = 37 | N = 37 
.04 .62** Se —.06 —.15 .08 
Pakistani N = 45 N=45|N=45|N=46|N=4 
.56** .58** 41** —.92* 928 
Social acceptance 
University N=47| N=51 | N =52| N = 52 | N = 52 
E .40** .28* —.02 10 
Catholic N-30| N 260 | N=60|N=55 | N = 55 
.82 .30* .66** | —.19 mí à 
Paki : r-60|]N260| N=60| N = 6 
Pakistani pl "pgs 09 43** 
Achievement test 
i i - Г = 47 | М№= 47 | N=47|N = 47 
University p) Fw | on | mail .31* 
'atholi i = N = 30 | N=30| N 228 | N=28 
аниа d лон | a5 | —.08 13 
Pakistani 
School grades N=681|N=51| N=51 
niversity 01 02 IT 
" = y= 55 | N = 55 
Catholic ш a ы .36** | —.02 
" s - N = 60 
Pakistani N = Е rhe .35** 
Physical skills у= 5 | N = 8 
niversity .04 .02 
= 55 | N = 55 
Catholie A .21 .07 
Г = 60 | N = 60 
Pakistani и .00 .82* 
Anxiety І М = 52 
niversity .29* 
Y N = 55 
Catholic .85** 
T N = 60 
Pakistani 04 


" 
"LESE 
P < .01. 


370 


and Un samples, and a statistically 
Significant (p « .01) .35 for the Pa 
Г! . . 
Ё Sdn between Physieal Skills 
and Anxiety were homogeneous 
among the three samples, and in no 
case did the relation differ signifi- 
cantly from zero. The same relations 
held for Freedom from Lying and 
Physical Skills, except that the cor- 
relation for the Pa sample reached a 
.05 level of significance of .32. 
Standardized academic achieve- 
ment tests were available only for 
the Un and Ca samples. These rela- 
tions were homogeneous for all vari- 
ables: Academic Aptitude, positive 
and significant at below the .01 level 
for both samples: Social Acceptance, 
nonsignificant for both samples; 
School Grades, Positive and signifi- 


respectively ; 
nonsignificant for 
xiety, nonsignificant 
; and Freedom from 
d significant at the 
n sample, nonsig- 
e for the Ca sam. 


nd the 
ptance, 
mogeneous correlations 
should reveal cross-cultural differ- 
ences, if such exist in accordance with 
the authors’ speculations. 

The first, Academic Aptitude anq 
Social Acceptance (popularity) lends 
itself poorly to plausible speculation 
(relations were positive and signifi- 
cant at below the .01 level for the 
Un and Pa samples, but approached 
Zero for the Ca sample). It might be 
Suggested that the Un and the very 
highly selected Pa samples value 
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brains in the social and possibly mari- 
tal market place, but that чаш 
Aptitude plays a less important T : 
for well-protected, middle- “Te Pii 
per-middle-class, sex-segregated а в 
olie girls who do not, as the Un um 
do, have to compete directly € io 
ог, as in the case of the Pa gi! rity 
prove themselves (as a true mamon 

group) equal to or better than e di 

The second, Academic Apti а 
and Physical Skills, approaches zs 
for the Un and Ca samples y 
positive and significant at less This 
the .01 level for the Pa sample. 1 is 
Suggests that the Pakistani ШИ ей 
trying to demonstrate lr she 
hold her place among males. 

The third set of need nee 
Social Acceptance and School Gra the 
was positive and significant - s 
05 level for all three samples. vni 
ever, the Ca sample showed a ihe pa 
cantly lower correlation than Lat n 
sample (.30 and .64 respectively ү 
nificantly different at the .02 me 
and the Un sample and Pa sa ap" 
where the difference closely level 
proached significance at the 05 ely): 
(rs were 40 and .64 respective e: 
This pattern of relations is ae adi 
To a greater degree for the we 
than for the American girl, educ re- 
means status; suecess in school ДЕ ПЕ 
sents emancipation and, perhaps, 7 
cessful competition with the cri 
Hence, it is more valued as а han 
terion for social acceptability ла] 
for United States girls. Ма! tures 
modeling may also enter the T'rdless 
as there is a strong trend, regà those 
of fathers’ social class, for on пей 
Pa girls whose mothers are e 
to reach ninth grade themselves, у 

Fourth, Social Acceptance n physi- 
and positively correlated wit ge 
cal Skills for both sex-sesregt oy 
samples (.66 for the Ca an reaches 
the Pa sample), but barely 
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significance at the .05 level (r = .28) 
for the coeducational (Un) sample. 
The difference between Un and Ca 
girls is significant at the .01 level, and 
almost reaches .05 significance for the 
Un and Pa samples. In a girls’ world, 
does athletic ability (a “masculine” 
trait) lend status, while it is of less 
importance in a coeducational world 
Where the competition against others 
of the same sex may be greater for 
the attention of boys than in the area 
of physical skills? 

In no case were these correlation 
Patterns altered when partial correla- 
tions were run, indicating that the 
Pattern of relations is not а function 
of a halo effect. 


Discussion 
Striking sample - intercorrelation 
Similarities exist among the three 
samples of U.S. University School 
hinth-grade girls (Un), Roman Cath- 
olie parochial school, sex-segregated 
Ninth-grade girls (Ca), and Muslim 
Sex-segregated ninth-grade Pakistani 
School girls (Pa). This suggests that 
translation of research instruments 
Yom one to another language (Urdu 
or the Pa sample), and their use for 
Populations from very different school 
Settings may be quite feasible. For 
example, a measure of Academic Ap- 


titude (intelligence) and school 
Marks in the so-called solid subjects 
Showed homogeneous correlations 


"mong the three groups, uniformly 
gh and positive (Un = 63, Ca = 
1 1, and Ра = .58). From a total of 
4 correlations between different vari- 
ables caleulated for all three cultures 

cademie Aptitude, Social Accept- 
ance, School Grades in the solid sub- 
Ieets, an index of Physical Skills, 
«dxiety, and Freedom from Lying— 
8 ack of Defensiveness’), only four 
ets of correlations were nonhomoge- 
ous, 


For the two United States samples, 
standardized tests of academic 
achievement were also available. 
Correlations between scores and all 
other variables mentioned in the pre- 
ceding paragraph were homogeneous 
for the Un and the Ca samples, rang- 
ing from high positive for Academic 
Achievement Tests versus School 
Grades (approaching r = .80) to al- 
most zero (Academic Achievement 
Test versus Sociometric Status, Phys- 
ical Skills, Anxiety, and Freedom 
from Lying). 

These results are pleasantly sur- 
prising from a research worker’s point 
of view when one considers the great 
cultural range. The Pa sample came, 
for example, from a Muslim culture, 
predominantly “purdah”—that is, 
veiled, exposed after childhood only 
to males from the closest family 
circle; was administered Government 
composed examinations, with no 
teacher influence on grades; and spoke 
English and/or were exposed to West- 
ern norms little or not at all. Resi- 
dence was in a great city of more 
than one million population. In con- 
trast, the Un sample was enrolled in 
an experimental, demonstration co- 
educational school in a small Mid- 
western, university-centered town; 
and the Ca sample was sex-segre- 
gated and resided in a moderately 
urban area (about one-half million 
residents). 

The nonhomogeneous correlations 
are of particular interest to the au- 
thors. They numbered four: 

1. Academic Aptitude and Social 
Acceptance, significantly positive for 
the Un and Pa samples, essentially 
zero for the Ca sample. This suggests 
that the coeducational University 
School community imbues intelli- 
gence with social desirability, the 
Pakistani-Muslim community re- 
gards it as a means whereby “women 
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may claim their own,” and the Cath- 
olie girls’ community views it rela- 
tively neutrally. 

2. Academic Aptitude and Physical 
Skills correlated about zero for the 
United States samples, but signifi- 
cantly positively for the Pakistani 
sample. The brighter, relatively sup- 
pressed Pakistani girl may believe 
that, to succeed, she must excel in 
everything and, in her deeply sex- 
segregated culture, be in competition 
with the male, fighting him for status, 

3. The Pakistani girls may also be 


struggling, along with the coeduca- 


tional girls, for intellectual status, and 
appreciating this a 


: iving. 
- 40 а unisexual eultu 
Catholie and Pakistani И es 
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tance to the coeducationally soared 
girl, who may value the attention [) 
boys more than her success in the 
motor area. 
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With the aim of determining the effects of teachers' belief systems 
upon the classroom atmospheres they create for preschool children, 
30 teachers, 10 in each of 3 levels of concreteness-abstractness, were 
observed and rated by pairs of independent judges on 26 dimensions 
relating to such factors as flexibility, attitudes toward rules, encourage- 
ment of the children's independence and creativity, need for structure, 


and punitiveness. The m 


concrete Ss in what is presumed to be 


direction on all dimensions. 


ore abstract teachers differed from the more 


an educationally favorable 


Teachers differing in the concrete- 
hess-abstractness of their belief or 
Personality systems should also dif- 
er in the goals, social milieu, and be- 

avior they seek for and from their 
Students and hence, knowingly or un- 

Nowingly, in the values and content 

ey communicate, in their styles of 
Communication and in their reactions 
to student adherence to or departure 
rom the standards embodied in their 

eliefs, Because concreteness disposes 
towards more fixed and categorical 

elief systems (Harvey, 1966; White 
& Harvey, 1965), toward an author- 
ity orientation rather than а task 
Orientation (Harvey, 1966; Tiemann, 
— 

"This study was supported by the Head 


ls fart Program, Office of Economic Oppor- 
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ean? to the many observers for their con- 
lentious effort in collecting the data and 
t the teachers in and directors of Head 
thon t Programs that permitted us to visit 
eir schools and observe their teaching. 


1965), and toward seeking a simple 
and highly structured environment 
(Harvey, 1965; Ware & Harvey, in 
press), more concrete teachers, to a 
greater extent than more abstract 
teachers, should impose predetermined 
goals upon the students, provide struc- 
ture and detailed means for their 
attainment, be less tolerant of student 
deviation from their goals and stand- 
ards, and, consequently, react more 
strongly and invariantly to such de- 
viation. A test of some of the more 
specific aspects of this general hy- 
pothesis was the purpose of this study. 


METHOD 


Head Start teachers differing in concrete- 
ness-abstractness according to two different 
ures were rated on 26 dimensions as- 
sumed to reflect educationally desirable and 
undesirable behavior toward their preschool 
students. The rating categories were (1) ex- 
pression of warmth toward the children, (2) 
perceptiveness of the children’s wishes and 
needs, (3) flexibility in meeting the needs 
and interest of the children, (4) ability to 
maintain relaxed relationships with the chil- 
dren, (5) attention to the individual child 
(6) task involvement, (7) enjoyment ot 
teaching, (8) enlistment of child participa- 
tion, (9) encouragement of individual re- 


meas! 


373 


374 


ibility, (10) encouragement of free ex- 
ы of feelings, (a1) encouragement of 
creativity, (12) teaching new concepts, (13) 
ingenuity in improvising teaching and play 
materials, (14) utilization of physical re- 
sources, (15) task effectiveness, (16) diver- 
sity of activities simultaneously permitted, 
(17) smoothness of classroom operation (es- 
pecially in the transition from one activity 
to another), (18) consistency of rule enforce- 
ment, (19) use of functional explanation. of 
rules, (20) use of nonfunctional explanation 
of rules, (21) use of unexplained rules, (22) 
rule orientation, (23) determination of class- 
room and playground procedure, (24) need 
for structure in teaching activities and rela- 
tionships with children, (25) punitiveness, 
and (26) anxiety induced by the observers’ 
presence. 
It was anticipated that the more abstract 
subjects (Ss) would score higher than the 


more concrete Ss on dimensions 1-19 and 
lower than more concrete 


Ss on dimensions 
20-26. 


Subject Selection 


Teachers wore selected for subsequent ob- 
servation on the basis of their responses to 
the “This I Believe" (TIB) Test scored ac- 
cording to conceptual systems. 

The TIB, developed Specifically as a 
Ineasure of conceptual or belief systems (e.g., 
Harvey, 1964, 1965, 1966; White & Harvey, 

965), Tequests S to indicate his beliefs 
about a number of socially and personally 
significant concept referents by completing 
1n two or three sentences the phrase, “This 
I believe about —  — — ," the blank be- 
ing replaced successively by one of the ref- 
erents. The referents employed in the pres- 
ent study were comprised of “religion,” 
“friendship,” “the American way of life,” 
“sin,” “education,” “the family,” “people on 
welfare,” “punishment,” “teaching,” and 
“sex”; the last six substituted for the more 
standard referents with the aim of insuring 
their specific relevance to the Ss and to the 
task of teaching in the Head Start Program, 

From the relativism, tautologicalness, 
novelty, and connotative implications or 
richness of the completions, together with 
criteria implied below, respondents may be 
classified into one of the four principal be- 
lief systems posited by Harvey, Hunt, and 
Schroder (1961) or into some admixture of 
two or more systems. 

More specifically, Ss were classified as 

representing predominantly System 1, the 
most concrete mode of dimensionalizing and 


construing the world, if their sentence com- 
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pletions denote such characteristics a pos 
absolutism, high tautologicalness, hig iater 
quency of platitudes and normative 5 sity 
ments, high ethnocentrism, high feno 
assertion of the superiority of Ame 
morality, and expression of highly pos! 
attitudes toward institutional referents. ting 
The Ss were categorized as e е 
System 2, the next to the lowest level hi hly 
stractness if, in addition to being. dum 
evaluative and absolutistie, they estere 
strong negative attitudes toward such rican 
ents as marriage, religion, and the Ame hic 
way of life, the same referents toward + hly 
System 1 representatives manifest high” 
positive attitudes. - 
Responses to the TIB were scored a 
resenting System 3 functioning, the ted 
the highest level of abstractness P more 
Harvey et al. (1961), if they indicate Sys- 
relativism and less evaluativeness ша 
tems 1 and 2 and at the same time ӨЛР ships 
strongly positive beliefs about frien 
people, and interpersonal relations. of the 
System 4 functioning, the highest ted by 
four levels of abstractness, was indica ree of 
TIB responses that implied a high om 
novelty and appropriateness, indepe d соп“ 
without negativism, high relativism a usage 
tingency of thought, and the gener еп” 
of multidimensional rather than unl 
sional interpretive categories. 
The TIB was administered 
teachers participating in the Hea түе 
training program conducted by the i 
sity of Colorado Extension Рамашова 
the summer of 1965. From among T n 
ber, no instances of System 2, the ases 
thority orientation, and only 10 “Ten Ss 
System 4 functioning were found. from ê 
each from System 1 and 3, prp ер” 
somewhat larger subpopulation af o ре 
tatives of these systems, were adde 
10 System 4 representatives, for 2 
30 experimental Ss, all women. 


um” 
j-nu- 
n af 


total n 


crete 
An Additional Measure of Со" 


ness- Abstractness pleted 


ate 
In addition to the TIB, Ss Com pstrac 
objective measure of concreteness cst), 
ness, the Conceptual Systems wean dus 
Which was developed through y Bailey’ 
ter and factor analysis UO which а 
1965*). Four factors were foun 7 
aav, “The <, 
* R. C. Tryon and D. E. Bailey, | ys" 
C. Try system of cluster and шо in pre! 
Multivariate behavioral researc, 
aration. 
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theoretically consistent with the major char- 
acteristics of the four principal conceptual 
systems or levels of concreteness-abstractness 
Posited by Harvey et al. (1961). These fac- 
tors, as we have tentatively labeled them, 
together with some of their representative 
items, are: 

1. Divine fate control or religious funda- 
mentalism. This is assessed by such items as 
There are some things which God will 
hever permit man to know,” “In the final 
analysis, events in the world will be in line 
With the master plan of God,” and “I be- 
lieve that to attain my goals it is only nec- 
ee for me to live as God would have me 

27 

2. Need for certainty or simplicity is ex- 
Pressed in response to such statements as “1 
Prefer a story that has two themes rather 
than one that has five or six themes going at 
Once,” “People who seem unsure and uncer- 
tain about things make me feel uncomforta- 
le,” and “The effective person is one who 
does not hold conflicting beliefs.” 

3. Tolerance of complexity and uncer- 
tainty is based on such items as “I have so 
much trouble finding out what is or is not 
true that I can’t understand how some 
People can feel so certain that they know 
the truth,” “More often than not, I like 
Some aspects of a person and do not like 
Other aspects of him," and "I find that I 
Cannot help analyzing almost everything I 
See and hear." 

4. Relativism of truth is measured by 
Such items as “There can be as many truths 
АЗ there are individual points of view," 

an is the judge of the truth or untruth 
9f his thoughts and behavior," and “Some- 
ling is true or untrue depending on one's 
assumptions and the context." 
а Of the several ways these factors may Бе 
Cored, the one utilized for this report was 

€ overall abstractness score, represented 
Y the sum of the item responses on а six- 
Point scale (from “Completely disagree” to 
ogmpletely agree”) across the four fac- 

TS, 


Subjects 


3 Ten representatives each of System 1, 
› and 4, as selected by the TIB, served as 
he experimental Ss. The mean score of 
oe system on the CST, the second meas- 
we of abstraetness, was: System 1, 2.98; 
System 3, 343; and System 4, 435. Ac- 
сота to 2 tests, each system differed sig- 
ificantly from each of the others (р < 05 


o : 
Т all comparisons). 
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All Ss had had prior teaching experience, 
System 1 teachers having taught on the 
average almost twice as long (10.1 years) 
as teachers from System 3 (5.8 years) or 
System 4 (5.4 years). However, only one 
System 1, two System 3 and three System 
4 teachers had taught previously at the pre- 
school level. All Ss participated in a com- 
mon l-week training program for Head 
Start teachers shortly before beginning their 
teaching of Head Start enrollees. 

The geographic locations where the Ss 
were teaching and were observed ranged 
from urban housing projects to small towns 
which served predominantly rural children 
in their Head Start programs. In an at- 
tempt to control for the influence of dif- 
ferent administrative structures and physi- 
cal facilities upon the teachers’ classroom 
behavior, an equal number of teachers of 
each belief system was selected from a com- 
mon administration and a common build- 
ing whenever possible. In the three in- 
stances in which representatives of the 
three belief systems could not be obtained 
from the same program, a teacher was se- 
lected from an alternate program that 
served children comparable in socioeco- 
nomic backgrounds to the children being 
taught by teachers of the other two belief 


systems. 


The Rating Scale 


Teachers were rated on the 26 behavioral 
dimensions (listed earlier) which had been 
selected primarily to reflect differences in 
the extent to which the teacher fostered in- 
dependence, creativity, diversity of in- 
terests, enjoyment, and intrinsic motiva- 
tion among her students, ends which would 
be desirable among most educators of to- 
day. 

Each of the behavioral dimensions was 
rated on a six-point scale: 3, 2, and 1 for 
“far,” “considerably,” and “slightly” above 
average respectively ; and —1, —2, and —3 
for "slightly," "considerably," and “far” 
below average respectively. The "average" 
category was omitted with the aim (by 
creating a forced choice condition) of 
avoiding the common tendency of obsery- 
ers (Os) to assign a wide variety of dis- 
criminably different behaviors to this cate- 
gory. Through a training program described 
below, an attempt was made to establish 
equivalent "averages" for all Os. 

To encourage specificity of ratings, the 
rating scale provided space for each be- 
havioral dimension to be rated under each 
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f the school activities of free play, directed 
play, reading and/or story telling, formal 
instruction, outdoor activity, snack „time, 
and cleanup activity. Owing to variation in 
their schedules and practices, many teachers 
were rated in relation to different activities, 
no teacher being rated for all activities. In 
addition to each S being rated on the 26 
dimensions for as many of the activities as 
occurred during the observation period, she 
was also assigned an overall rating on each 
dimension by each O. An overall rating, in- 
stead of being the arithmetic average of the 
multiple ratings of a dimension under the 
different activities, represented the overall 
impression of an O for thnt dimension 
based on his interpretation of his specific 
ratings together with unrated behavior he 
may have observed or other impressions he 
may have gathered. In some instances the 
overall rating of a dimension would closely 
parallel the arithmetic average of ratings 
on that dimension and in other instances it 


would not. All statistical analyses were 
based on the overall ratings. 


Interjudge reliability for th 
assessed at two points 
lowing the last training 
ately after completion of the experimental 
observations, 3 weeks later, The mean of the 
correlations between the ratings of every 
pair of judges, based both times On obser- 
vations of a System 3 teacher, was 69 and 
68 for the earlier and later time respec- 
tively, values which, while lower than the 
ideal, are as high as could be expected given 
the number and diversity of Os. 


r the 14 Os was 
» immediately fo]. 
Session and immedi- 


Procedure 


Each S was observed for approximately 
2% hours by two Os on two occasions 1 
week apart. The administrator of each Head 
Start program was asked to give the teach- 
ers advance notice of the dates they were 
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to be observed; this was done in all but two 
instances. 

The Os, in pairs, arrived before oe за 
gan, introduced themselves, SEP aD: 
(with the aim of allaying the tanaher ae 
prehension and fostering her pe apie а: 
that the purpose of their visit was to & hich 
examples of good teaching procedures W. ent 
could be utilized as bases for Dpto Ea 
training programs for Head Start kp abe 
and requested that they be allowe of 
serve but to remain as unobtrusive M Кыт 
sible in order to minimize the effects o Os! 
presence upon the children, To further 
unobtrusiveness and simultaneously og 
increase the likelihood of an $ fo pie 
her planned program and, in 50 doing, Ог; 
ting somewhat typical teaching be eon 
each S was asked specifically not to riod. 
verse with the Os during the class p үз 

The observations took place йш on 
mal classroom and playground activi i Я 
days free of special events in order to Г arie 
the conditions of observations as cont ort 
ble as possible for all Ss. As a further o 
toward maximizing the comparabi T ine 
the judgments, each pair of Ов. ity at 
structed to remain in the same buone the 
all times so that both would witne 
same behavior of a teacher. . Os in 

During the observation period 26 di- 
dependently rated S on all of the s that 
mensions under each of the ceu the 
occurred during the period. Boer rated 
observation period, Os independent! Loch of 
their overall impression of the 5 on s - 
the behavioral categories. Ratings wet" each 
Sequently compared and discussed ditio? 
pair of Os and under the single ER due 
Where disagreement, was found to owt fo 
to different, usage of the rating өдө, Ад ake 
no other reason, either O was free of this 
changes in his ratings. The purpose insure 
rather unorthodox procedure was tegorie? 
that Os were defining the rating © contro 
in similar ways and in so doing за ye" 
for the variance that would othe е. This 
sult from difference in scale um two 
Procedure was based on ANGIE i rvey 
earlier studies (Harvey, 1963; , AT eviews 
Kline, 1965) that postobservation counter” 
of their ratings by Os effectively ay from 
acted their tendencies to drift A of time 
the original criteria with the past o То 
and the accumulation of бнрн observe 
avoid biasing the Os who were б sesion 
the S on the second boom 0 the P9?! 
concerning any S was restri 3 
of Os wha Бай observed her tenet ed and 

Any О bias was further con! 
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"id Sened experimental controls achieved 
with gh the policies of no O being paired 
» апу other О more than once and 
rds eing observed by the same O more 
of р 096, Any S bias resulting from spread 
teach Ormation about the study from one 
ne to another was minimized by the 
ait ure of observing on the same day 
d eachers who were within the same 
ool system. 
ue Os made their observations and rat- 
ee any knowledge of the system 
Pe fication of the teacher. Moreover, in 
pat, ? prevent the Os from establishing 
Жел, ar sets and expectancies from re- 
Шы Observation of Ss from any one be- 
mn tem, each O observed a different 
thr m order of representatives of the 
ee belief groupings. 


RESULTS 


Relatio B а à 
nships among the Rating Di- 
mensions á ш " 


M relationships among the rating 
Veco ins were determined by а 
Bait cluster analysis (Tryon & 
zo 1965, 1966) based on the 
26 med overall ratings of each of the 
an imensions across the 30 teachers 
Cory, the four judges. Interdimension 
relations ranged from .86 to —.90, 
терага, median correlation, without 
320819 to sign, of .53. Fifty-five of the 
* Ms attained a magnitude of 
9f th — .70 or above and all but one 
itte, rel were accounted for by the 
e its atedness among 15 items which 
€d into one of two major clusters. 
termed first cluster, which may be 
of 184 dictatorialness, was comprise 
facto rating items. They and their 
tur.” loadings were: need for struc- 
Orion, 97), flexibility (—91), rule 
fre, ation (.89), encouragement 0 
deten Pression of feelings (—-86), 
ot Mation of procedure (.86), use 
hes," "Xplained rules (.84), punitive- 


Чу ( bae , encouragement of creativ- 
àctiyi O), diversity of simultaneous 
fing CS (—.77) and encouragemen 


The "idual responsibility (—-71)- 
Second cluster, which centered 
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around task orientation, consisted of 
five items with factor loadings as in- 
dicated: warmth (.89), perceptive- 
ness (.88), task effectiveness (.77) 
utilization of physical resources (72), 
and ingenuity in improvising teach- 
ing and play materials (.65). 

With the exception of the correla- 
tion between anxiety induced by 
judges’ presence and relaxed relation- 
ships with the children, none of the 
correlations among the 11 dimensions 
not included in one of the two major 
factors attained the magnitude of 
+ or — .70. Hence, while more than 
half of the rating items represented 
different ways of tapping either of 
two common behavior patterns, some 
of the items reflected fairly inde- 
pendent teacher behavior. 


System Differences 

Performance on specific dimen- 
sions. According to the hypothesis, 
System 4 Ss, as classified by the 
TIB, should score higher than System 
1 Ss on dimensions 1-19 and lower 
than System 1 teachers on dimensions 
20-26. A corollary hypothesis was 
that System 3 teachers would score 
between Systems 1 and 4 on all 26 
dimensions. 

The mean and s 
of the overall ratings of т 
on each dimension аге presented in 
Table 1. All values are based on the 


ratings of four Os owing to the results 
of a preliminary analysis of variance 
which showed that no significant dif- 

between the ratings 


ferences existed bet з 
of the teachers at Time 1 and Time 2. 
Results in Table 1 show the pre- 


dicted differences between Systems 1 
and 4 on all 26 dimensions. In further 
accord with the predictions, System 3 
Ss scored between Systems 1 and 4 on 
93 of the 26 items. Page’s (1963) test 
for ordered hypotheses for multiple 
treatments, a test for predictions of 


tandard deviation 
each system 
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TABLE 1 - 
ж DT 
MEAN AND STANDARD DEVIATION FOR RATINGS or EACH System AS MEASURED BY 
Tus I BELIEVE TEsT 


System 1 System 3 System 4 
Rating dimension 
ating dime: p E» p Б uf SD 
d. Wasa $55 | 049 | 3.97 | 107 | 4.27 Le 
2. Perceptiveness 3.38 0.71 3.90 0.86 4.15 0.97 
3. Flexibility 3.30 | 0.69 | 3.97 d Hs 0.32 
4. Relaxed 4.13 | 0.68 | 4.50 | 0.6 E се 
5. Attention to individual 3.67 | 0.60 | 4.10 | 0.80 ay | oS 
6. Involvement 3.90 0.78 4.13 0.84 -19 $ 
T 0.97 
7. Enjoyment 3.67 0.51 4.02 0.53 4.13 i 
8. Enlistment child participation | 4/3; 0.85 | 4.25 | 0.87 | 4.32 | 0.79 
9. Encourage individual responsi- 0.94 
bility 3.63 | 0.53 | 4.17 | 0.67 | 4.25 (4 
10. Encourage expression of feelings | 3.47 0.55 3.92 0.65 4.20 б 
11. Encourage creativity 3.15 | 0.49 | 3.65 | оо | 382 «c 
12. Teach new Concepts 3.95 0.78 4.05 0.59 4.32 EE 
13. Ingenuity 3.30 | 0.64 | 3.85 | ogg | 4.27 45 
14. Utilization of resources 3.92 | 0.60 | 415 | ose | 4.50 e 
15. Task effectiveness 3.82 | 0.44 | 4.29 0.48 | 4.35 ani 
15. Diversity of activity 3-52 | 0.69 | 4.00 | 050 | 405 0.65 
17. Smoothness 3-85 | 0.77 | 402 | 053 | 4.15 Eg 
18. Consistency 3.95 | 0.57 | 402 | 055 | 407 029 
do Functional explan, rules 3.38 | 0.66 | 3.85 | 074 | 377 d 
2 Unekinetional explan. 3.02 | 0.71 | 297 | ові | 2.88 04 
22. Rule ined rules 3-90 | 0.77 | 3.30 | 060 | 3.05 en 
23. Dei, entation | 447 | 0.76 | 3.70 | 0,87 | 3.20 0.79 
s ermination of procedure 4.40 0.94 4.25 0.68 3.63 “05 
24. Need for structure 145 | 1.00 | 36 | 0.93 | 3.13 | L 
ve i й : ї Ў 78 
25. Punitivenesg 50 0.7 
25. Auster 3.57 | 0.90 | 2:90 | ова | 2. 0.43 
; y 8-27 | 0.92 | 2.77 | бе | 280 im 
Note—w = 10 for each system, 


order and not of magnitude, sho d 
that the number of qi f 


which the predicted or 
tems was obtained was hi 
cant (М = 26, N = jm 
001). 

While no predietions were made 
concerning the magnitude of differ- 
ences between systems on any specific 
dimension, a test of significance of 
magnitudes would yield additional 
relevant information. One-tailed ғ 
tests between systems on overall rat- 
ings indicated, on the basis of 18 de- 
grees of freedom, that System 1 and 4 
teachers differed significantly on 14 
dimensions (Table 2). System 4 


h 
teachers expressed greater Mame. 
toward the children, showed gre pes 
perceptiveness of the children’s wis in 
and needs, were more flexible the 
meeting the interests and needs e 
Children, maintained more re were 
relationships with the children, re- 
more encouraging of individus... 
Sponsibility, gave greater ee” 
ment to free expression of fen yiti 
Were more encouraging of ere impro" 
displayed greater ingenuity 1n tarials 
Vising teaching and play bot fre- 
invoked unexplained rules у were 
quently, were less rule — and 
less determining of classr rested less 
playground procedure, manifes 
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need for Structure, were less punitive, 
and were less anxious about Os 
presence, 

System 3 teachers differed signifi- 
cantly from System 1 Ss on eight 
dimensions. The former were more 
flexible in meeting the needs and in- 
terests of the children, were more en- 
couraging of individual responsibility, 
Were more task effective, permitted 
greater diversity of activities, invoked 
unexplained rules less frequently, were 
less rule oriented, manifested less 
Deed for structure, and were less puni- 
tive. 

. System 3 and System 4 Ss differed 
Significantly only on one dimension; 
the latter were significantly less deter- 
Mining of the classroom and play- 
&round procedures. 

Dictatorialness and task orienta- 
tion. The differences on the specific 
behavioral dimensions may be sum- 
Marized and highlighted by a between- 
Systems comparison on the two major 
factors extracted from the rating di- 
mensions. With the exception of re- 
axed relationships with the children 
and anxiety from Os’ presence, all of 
the specific dimensions on which Sys- 
tems 1 and 4 differed significantly 
Were contained within either the fac- 
tor of dictatorialness or task orienta- 
tion. 

The scores on dictatorialness were 
3.88, 3.32, and 3.00 for Systems 1, 3, 
and 4 respectively; and 3.62, 4.02, and 
131 on task orientation. System 1 

Cachers were significantly more dic- 
tatorial than representatives of either 
System 3 (t = 217, p « .025) or 
System 4 (2 = 2.92, p < .005) and at 

© same time significantly less task 

°riented than teachers from System 3 

t = 1.73, p < .05) or System 4 (t = 
242, p < 095). There were no signifi- 
"int differences between Systems 3 
“nd 4 on either factor, although Sys- 
tem 3 teachers tended to be somewhat 
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TABLE 2 


t Tests BETWEEN RATINGS ОР Systems as 
MEASURED BY THE Тніѕ I BELIEVE Test 


System | System 


System 
1-3 14 3-4 


Rating dimension 


. Warmth 

. Perceptiveness 

. Flexibility 

Relaxed 

. Attention to individual 

Involvement 

. Enjoyment 

. Enlistment child par- 

ticipation 

. Encourage individual 

responsibility 

10. Encourage expression 
of feelings 

11. Encourage creativity 

12. Teach new concepts 

13. Ingenuity 

14. Utilization of resources 

15. Task effectiveness 

16. Diversity of activity 

17. Smoothness 

18. Consistency 

19. Functional explan. 
rules 

20. Nonfunctional explan. 

21. Unexplained rules 

22. Rule orientation 

23. Determination proce- 


LECT 


ure 
24. Need for structure 
25. Punitiveness 
26. Anxiety 


05 or <.05, 18 df, one-tailed. 
<.025, 18 df, one-tailed. 
р < .005, 18 df, one-tailed. 


more dictatorial and less task ori- 
ented. 


Differences between Levels of Ab- 
stractness Derived from the CST. 


The abstractness scores from the 
CST were divided into three levels of 
High, Middle, and Low, 10 scores in 
each group, and £ tests were computed 
between the means of the two extreme 
groups for each of the 26 teacher be- 
havior dimensions. 

Results in Table 3 show that on all 
but four of the 26 dimensions (enlist- 
ment of child participation, teaching 
new concepts, smoothness of opera- 
tions, and consistency of rule enforce- 
ment) the High-Abstraet teachers 
out-performed the — Low-Abstraet 
teachers. In further support of the 
hypothesis, the Middle-Abstract group 
was rated more favorably than the 
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TABLE 3 


Mean AND STANDARD DEVIATION FOR RATINGS or Low, MIDDLE, AND HIGH ABSTRACT 
TEACHERS AS MEASURED BY THE CONCEPTUAL SYSTEMS TEST 


Rating Dimension 


Low 


Middle High 


E 


о 
© 
У 
a 
S 
к 
чл 
S 


1. Warmth 

2. Perceptiveness 

8. Flexibility 

4. Relaxed 

5. Attention to individual 

6. Involvement 

7. Enjoyment 

8. Enlistment child participation 

9. Encourage individual responsi- 
bility 

10. Encourage expression of feelings 

11. Encourage creativity 

12. Teach new concepts 

13. Ingenuity 

14. Utilization of resources 

15. Task effectiveness 

16. Diversity of activity 

17. Smoothness 


18. Consistency 

19. Functional explanation of rules 
20. Nonfunctional explanation 

21. Unexplained rules 

22. Rule orientation 

23. Determination procedure 

24. Need for Structure 

25. Punitiveness 

26. Anxiety 


M» Q3 ©2 ©з ROW Со 
оо о 
о о o 8268205858 
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Note.—N = 10 for each group. 
Low-Abstraet group, but le = 
ably than the His dnd E 
on 20 of the 26 dimensions. Page's 
(1963) test showed the frequency of 
predicted orderings to be Significant 
(M = 26, N = 3,1, = 341, p < 001) 
While the number of predicted or- 
derings was significant, the magnitude 
of the difference between the three 
CST levels, as assessed by t tests, at- 
tained significance in only five of the 
78 comparisons, a frequency that de- 
parts only slightly from chance. Hence 
none of these ¢ values is reported. 
Abstractness scores from the CST 
correlated .30 (p < .05) with both 
dictatorialness and task orientation, 
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meaning, because of the direction = 
Scoring, that higher abstractness jess | 
companied the tendency to be ted 
dictatorial and more task orie” rt 
in the teaching of the Head St? 
children. 


Discussion 


The results are consisten 
ing that the more abstract teat?” 
differ from the more concrete | "de 
their teaching approaches and B 
classroom atmospheres they Poe 
ated for their Head Start suomi, 
Two important factors, the see?" e- 
process inherent in the cboice 0 the 
coming a Head Start teacher 27 


Я w- 
in sho 
И cher? 


TEACHER PERSONALITY AND SCHOOL ATMOSPHERE 


l-week training program in which all 
teachers participated, should have op- 
erated to produce greater similarities 
In teaching styles and behavior to- 
ward children among our Ss than 
exist among teachers selected ran- 
domly from the different belief sys- 
tems and/or levels of abstractness. 
Despite this, the more abstract teach- 
ers in this study were clearly superior 
to the more concrete teachers in the 
extent to which they produced what 
are presumed to be educationally de- 
Sirable atmospheres in their class- 
rooms. Moreover, this superiority ex- 
isted despite, or possibly because of, 
the greater teaching experience of the 
System 1 Ss. 

We can only conjecture at this 
Point on the differential effect of the 
different, atmospheres, and hence of 
teacher differences in concreteness- 
abstractness, upon the learning and 

ehavior of the children. The an- 

Swer to this, a truly significant educa- 
tional question, can come only from 
Study of the children who have been 
taught by teachers differing in ab- 
Stractness. 

Although both the TIB and CST 
Predicted significantly differential 
Performance in the classroom, the re- 
Sults point clearly to the greater su- 
Deriority of the TIB. It is possible 
that this superiority may be reduced 
Ог eliminated by a different scoring 
method of the CST now being tested, 
One which is based on a profile of fac- 
tors and categorizing of Ss into pro- 
ile groupings rather than on апу 
Single factor or summation of factors. 
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COGNITIVE TRANSFER IN VERBAL LEARNING! 


JAMES H. 
Colgate 


In a 2-stage transfer paradigm, a 
was first familiarized with a map 
terial embedded in an organized p 
8 simple sentences giving factual 
studied map. Control condition 


Tolman (1948 


) has attributed much 
of the learnin 


g of animals and men to 
their reliance upon a “cognitive map,” 
into which new learning is meaning- 
fully. assimilated, His figurative ex- 
pression refers to an assumed mental 
Organization or Structure, built up 
from past experience, which makes the 
learner’s Surroundings meaningful, It 
is hypothesized that if new situations, 
requiring learning, can be incorporated 
into а Meaningful structure, learning 
is relatively easy ; if not, learning is by 
rote, and more difficult, This basic hy- 
pothesis has been elaborated by other 
theorists (e.g., Ausubel, 1963; Bruner, 
1957), and forms a theoretical basis 
for such school-learning practices as 
understanding versus drill, the use of 
visual aids to introduce new topics, 
and other methods designed to broaden 
the student’s conceptual overview of 
academic subjects. 

Experimental attempts to demon- 
strate the effect of cognitive structure 
upon human learning ordinarily em- 
ploy a two-stage transfer paradigm. 

*This research w 


search grant from th 
Education, Contract 


as supported by a re- 
e United States Office of 
No. OE-6-10-43. 


REYNOLDS 
University 


cognitive-le. 


arning group (N = 10) 
drawing whi 


ch presented verbal ma- 
erceptual structure, and then learned 
material related to the previously- 
Eroups (N — 10 each) learned the 


In the first stage the learner 18 {ге 
miliarized with materials which M 
vide opportunity for the formation se 
a meaningful organization or Led 
ture, and in the second stage oe hag 
terial which bears some relationship © 
that of the first stage is presented y 
learning. Observation of posi ing 
transfer in the second-stage ve 
task is taken as evidence that а CO 
nitive structure was established 
the first-stage treatment. 4 
Ausubel (1963, Ch. 5) has revie i 
the extensive rescarch literature s 
onstrating by this method the wes 
of cognitive structure upon on 
learning. Most of this research, or 
ever, has employed strictly verba sae 
terials in the first stage (e.g. er NT 
rule-learning),, and tested for ма а 
to such broad and complex io demie 
the learning of conceptual 3 iffi 
Subject matter or the solution 0 ffects 
cult problems and puzzles. The ^ cog- 
of perceptual organization, T 0 
nitive effects upon the mago 
simple verbal tasks, have vostiga" 
little study. Also, in many mv om- 
tions the cognitive condition ondition* 
pared only with control con 
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which in the first stage received no 
treatment relevant in any way to the 
Second-stage learning task. Since cog- 
nitive material is usually complex in 
Structure and consists of a variety of 
Interrelated stimulus components, it 
18 possible that the positive transfer 
Obtained may be attributable not to 
the organization or meaningfulness of 
the first-stage task per se, but rather 
to certain discrete elements of that 
task. Were this the case, findings of 
Positive transfer might be explained 
better by assuming a discrete stimu- 
lus-response (S-R) association proc- 
ess, rather than inferring a cognitive 
mechanism. Unless appropriate con- 
trol conditions are employed, it is dif- 
ficult to make a clear decision between 
these alternative explanations of the 
Positive transfer obtained. 
he present study was directed at 
extending and clarifying three aspects 
of the cognitive research available to 
ate. First, the cognitive material pre- 
Sented in the first stage of the transfer 
Paradigm was an integrated percep- 
tual-verbal structure rather than the 
Organized verbal structure often em- 
Ployed. The success of visual aids in 
facilitating learning, and the impor- 
ance attributed to visual stimuli by 
estalt and other cognitive theories, 
Suggest that a visual organization 
Should have strong positive transfer 
effects to the learning of related verbal 
Material. Taking Tolman quite liter- 
ally, the cognitive material employed 
was a map-like drawing on which ver- 
bal stimuli were printed. Second, the 
®arning task following the cognitive 
Presentation consisted of learning spe- 
a € facts related to the previously 
со Үп map, rather than learning broad 
Опсер{ в, This type of task was em- 
ч Oyed for two reasons: (а) Much of 
ip ademic learning can be character- 
zed as the assimilation of new factual 
Material into an existing cognitive 


structure (Ausubel, 1963), and (b) the 
simplicity of the task made it possible 
to analyze transfer effects in terms of 
S-R, as well as cognitive, relation- 
ships. 

Finally, an attempt was made to de- 
termine the locus of the expected 
transfer by employing five control 
conditions. The map used in the cog- 
nitive treatment integrated several 
discrete stimulus components—for ex- 
ample, the drawings of certain objects 
in a specific relationship to each other, 
various verbal stimuli, and positional 
relationships between the verbal stim- 
uli and the pictured objects. It was 
considered that only certain of these 
several stimulus aspects, and not the 
total structure itself, may be responsi- 
ble for positive transfer. Also consid- 
ered was the possibility that the map 
structure might serve simply to es- 
tablish implicit verbal S-R associa- 
tions which could mediate later learn- 
ing and result in positive transfer. To 
evaluate the possible effects of these 
separate components upon second- 
stage learning, various control groups 
received in the first stage either the 
map and the verbal stimuli separately 
(i.e., not integrated into a meaningful 
structure), the verbal stimuli alone in 
varying configurations, verbal stimuli 
which provided only discrete S-R. me- 
diating associations, and the usual 
transfer control of no treatment prior 
to second-stage learning. 


METHOD 


Subjects 

The subjects (Ss) were 60 college men at 
Colgate University who participated to ful- 
fill a requirement for an introductory psy- 
chology course. All Ss were assigned to 
experimental conditions according to a pre- 
determined random order as they appeared 
individually for the experiment. 


Design and Procedure 


Stage II task. A total of six groups were 
employed (N = 10 per group), Ss in each 
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ing exposed to two experimental 
pic Ets II task was to learn the 
following eight simple sentences, each be- 
ginning with the three-letter initials of a 
hypothetical person and ending with a state- 
ment of that person's occupation: KOT isa 
pilot, NEB is a gas station attendant, BAF isa 
shopkeeper, PUM is a truckdriver, DOS is a 
policeman, Raz is a cook, Fer is a brakeman, 
and TUK is a farmer. The initials used were 
CVC's of 85-100% association value taken 
from Glaze's (1928) list. These sentences 
constituted factual material about eight hy- 
pothetical persons, and were also considered 
as eight paired associates, each with the ini- 
tials serving as the S-term and a word of a 
certain class (occupations) Serving as the 
R-term. 

Following various Stage I treatments (de- 
scribed below), Ss in all groups were seated 
before a Stowe memory drum set ina 3 X 
4-foot black screen and instructed to learn 


the eight sentences as quickly as possible, 
Seven learning trials we 
tered. On each 


Оп every tria]. 

5—Cognitive Structure 
of the Stage T treat- 
the Cognitive Struc- 


ture Group was to impose experimentally a 


meaningful and integrated Cognitive repre- 
sentation of stimulus components related to 
the Stage II learning task. The integrative 
representation used was an 8% x ll-inch 
map sketch depicting a highway intersec- 
tion and its surroundings, including an air- 
strip, a gas station, a shopping center, a 
diner, a farm, and a railroad. A tractor- 
trailer and a police patrol car were drawn 
on the highways, Beside each of these eight 
major components of the map structure was 
typed a set of three-letter initials, the stim- 
uli for the Stage IT task. Figure 1 illustrates 
the map and the initials embedded in their 
appropriate positions, 
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The S was instructed to study the initials 
on this map for 30 seconds and meone 
them so that he could reproduce thet 
their proper positions on a later test. / had 
30 seconds the map was removed and S ha 
1 minute to recall and write on a mae ae 
the missing initials in their correct p rond 
Following the first test two more 30-sec ae 
study periods were given, with a TI a 
test between each, providing adequate 5 "n 
for familiarization with the initials erem 
correct placement. At the end of this * ү 
I treatment, it was presumed GB fna to 
Would be familiar with all of the initials to 
which he would later (Stage II) have А 
sociate occupations, and (b) that the in " 
would be embedded in a representative a 
meaningful cognitive structure (ie., the 1 
tegrated map) which would aid in Stage 
learning. m 

Map-List Group. The Cognitive fro 
received in Stage I familiarization of "ifie 
verbal and perceptual stimuli in a Spe ese 
relationship. The remaining groups were 5 
sentially control conditions, providing аа 
Stage I varying degrees of familiarize" ae 
with either verbal stimuli alone, Bode 
grated verbal and perceptual stimuli, xd the 
ther type of stimuli, but never all =a = 
components of the total map stimulus 1 
integrated relationship. isted 

Stage I for the Map-List Group consista 
of presenting to S for study (a) к. dded 
shown in Figure 1 without any em AUS of 
initials, and (b) а list of the eight Se 


* i of 
initials on a separate 3 X 8-inch pes 
Paper, typed one beneath the other Y». 


double Spacing between each. Instru a 
Were to study the map and the ect 
three 30-second periods; with а bu was 
test after each study period, in which Jd гё“ 
to write down all the initials he cou 8- 
call, in the correct list order, on & eight 
inch test strip which contained only onses: 
lines on which to write the correct roaring 
The separate map remained in view at 20 
both the study and test periods, but part 
time were any initials placed on апу Lj 
of the map. Thus, S was familiarize’ | 
the initials and with the map, but thes? 
components were never meaningfully Б 
10 a single structure. r th 

List Group. The Stage I treatment EE ist 
List Group was identical to the map wes 

Troup, except that the separate d the test 
Omitted from both the study an vided 127 
periods. Thus this treatment prol sti 
miliarization with the discrete ver mbedded 
uli, but these stimuli were neither € 


two 
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Fic. 1. Map structure presented to the Cognitive Group during the Stage I study trials. 


ma perceptual structure nor was & poten- 
tially useful structure presented. 
Positioned-List_ Group. Asch, Hay, and 
Diamond (1960) have shown that serial 
learning may be affected by the physical 
configuration or positioning of the items be- 
ing learned. Therefore it was considered 
Possible that the specific positioning per se 
of the initials on the map (Figure 1), for 
example, кот in the upper left corner, DOS 
in the lower right corner, ete., might iso- 
late the stimuli in a manner which would 
facilitate learning in Stage II regardless of 
Whether the map was presented or not. As 
& control to evaluate this possibility, the 
‘osition-List Group received for study in 
Stage I an 8% X 11-inch sheet of paper on 
Which the eight initials were printed in the 
Same relative positions as shown on the map 
in Figure 1, but with the map omitted. This 
Broup also received three 30-second study 
Periods, interspersed with three test periods 
in which S wrote all the initials he could 
recall in their appropriate positions on an 
8% X 11-inch test sheet containing only 
lines on which to write the correct responses. 
his Position-List Group was similar, then, 
to the List Group, in that it received fa- 


miliarization of the initials in the absence 
of the map structure; but differed from the 
List Group in that it learned the initials in 
a specific configuration identical to the map 
positions, rather than in a vertical list. 

S-R Group. One alternative interpreta- 
tion of the function of the map in Figure 1 
is that it provides not a “cognitive struc- 
ture” but (a) familiarization with the verbal 
stimuli, which may transfer positively to the 
Stage II task (Goss & Nodine, 1965, Ch. 6), 
and (b) an opportunity to learn a series of 
discrete verbal S-R associations, which may 
mediate Stage II learning and thus produce 
positive transfer. An example of the latter 
interpretation is that study of the upper 
left corner of the map may yield the im- 
plicit verbal association “airstrip-Kor” which, 
at Stage II, will mediate the association 
“xor-pilot” because of the extraexperimen- 
tal association existing between “airstrip” 
and “pilot.” In transfer paradigm form 
this is an A-B, B-A’ relationship; and Hous- 
ton (1964) has reviewed evidence suggestin, 

А " z g 
that this paradigm should yield positive 
transfer. 

To evaluate the adequacy of this alterna- 
tive explanation of the map function, an 
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: i lank 

A roup received in Stage I a b 3 
e bd Linch study sheet on which xs 
typed, in the same relative positions as n 

the map in Figure 1, the following eig 
з ти EID a 

с: 1 center-BAF, z. 

a oy Rer. diner-naz, train-FER, 
{агт-ток. The Ss in this group also re- 
ceived three 30-second study periods, inter- 
spersed with l-minute tests during which 
they wrote the initials in their correct posi- 
tions on a test sheet which contained the 
eight nouns alone, typed in their proper 
places on the sheet. Thus the S-R Group 
received both familiarization with the ini- 
tials and an opportunity to form discrete 
verbal associations of the type presumably 
available to the Cognitive Group, but no 
potentially integrative relationship among 
the items was provided, 
Rest Group. A final control group learned 
the list of sentences in Stage II, but re- 
ceived no Stage I treatment at all. This 
Rest Group was used to evaluate the effects 


of all other Stage I treatments upon Stage 
II learning, 


RESULTS 


Stage I learning. Despite the ease 
of the Stage I tasks, one to three Ss 
1 every group failed to learn all of 
the initials in the three study periods 
administered. The mean number of 
correct initials given on the last test 
in Stage I ranged from 7.00 to 7.60 
for the five groups receiving the Stage 
I treatment. A preliminary analysis 
of variance verified that the differ- 
ences among these means were not sig- 
nificant, F (4,45) < 1.00, p > 10; in- 
dieating that while Stage I learning 
was not perfect, the conditions receiv- 
ing the various Stage I treatments did 


James H. REYNOLDS 


not differ in degree of cur EE 
with the initials at the time Stage 
began. 

tige II learning. The means алй 
SD's for the total number of o 8 
responses made by each group ni 
the seven trials of Stage II learni i 
are presented in Table 1. The а 
difference among the means, tested a 
analysis of variance, was highly S!£ 
nificant, F (5,54) 3.77, p A tle 
Comparisons of individual con m. 
group means, using the Newm: af 
Keuls method, indicated no significa s 
differences among the five Lair 
Which did not receive the m 
Group treatment. Neither familiari a 
tion of the initials alone, familiari? 
tion of the initials in the presence 
the separate map structure, nor 5 
provision for verbal mediating аар ” 
ations, resulted in positive tran s 
When compared to the Rest T all. 
ceiving no Stage I treatment а he 
Turning to the Cognitive arci 
however, further Newman-Kuels Р “К 
parisons showed that the epe 
between the Cognitive Group and ж А 
of the control conditions was Sig pil- 
cant beyond the .05 level of proba» 8 
ity. Thus positive transfer pee 
tained, but only when the Hed 
components of the Stage I task сер- 
integrated into an organized pet 
tual-verbal structure, А rves 

Figure 2 presents the learning E п 
for each group over the seven e one 
learning trials, illustrating the itive 
sistent superiority of the Cog? 
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Сод. бр. 
S-R 6p. 
Map-List Gp. 
List Gp. 
Position-List Gp. 
Rest-Control 6p. 


Group beginning with the first trial. 
Individual ё tests indicated that the 
Trial 1 mean for the Cognitive Group 
was significantly higher than the sec- 
ond highest S-R Group mean, £(18) 
= 2.59, p « .02. The S-R Group, al- 
though it showed an initial trend to- 
ward higher performance than the 
other controls, did not differ signifi- 
cantly from the Rest Group on the 
first trial, £(18) = 1.32, p > .10. Dif- 
ferenees among the remaining four 
control groups were also nonsignifi- 
cant. Following the first trial the 
Broups tend to converge, the Cogni- 
tive Group approaching the test ceil- 
ing by Trial 3 and the others ap- 
Droaching ceiling at a later point. 
Although learning was rapid in all 
groups, none of the conditions matched 
or exceeded the Cognitive Group after 
Seven trials of learning. 


TRIALS 


Fic. 2. Stage II learning curves for all groups. 


Discussion 


The data clearly indicate that prior 
exposure to an integrated and poten- 
tially meaningful perceptual structure 
may provide positive transfer to a re- 
lated task in which simple factual ma- 
terials are to be learned. The evidence 
is consistent with past findings dem- 
onstrating positive transfer from 
meaningful verbal structures to more 
complex tasks such as problem solv- 
ing and the learning of conceptual ma- 
terials, and confirms their results with 
another type of structure and a sim- 
ple but common learning situation. 

The S-R Group did not differ sig- 
nificantly from the other controls, de- 
spite a trend toward superior per- 
formance on the initial trials of the 
Stage II task. Thus the A-B, B-A’ 
paradigm, when used with discrete 
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iative materials, was 

p s percitus in producing 
on transfer. In contrast the Cog- 
itin Group, which received the 
eee ke A-B, B-A’ sequence but 
th integrated perceptual stimuli in 
the A-B stage, demonstrated strong 
positive transfer. Whether the supe- 
riority of the latter group was due to 
the perceptual aspect or the integra- 

tive aspect of the Stage I stimuli can 

not be determined from this compari- 

son, but regardless of the specific locus 

there is a strong suggestion that the 

integrated perceptual stimuli 


pre- 
sented to the Cognitive Group pro- 
vided more than a series of discrete 


verbal mediators. 

The Cognitive Group was also con- 
sistently superior to the three control 
groups which received various compo- 
nents of the total map structure dur- 
ing Stage I; but none of these three 
controls demonstrated positive trans- 
fer relative to the Rest Group, which 
received no Stage I treatment at all. 
This lack of positive transfer in the 
component control groups suggests 
that the transfer observed for the Cog- 
nitive Group was not attributable to 
the specific map components pre- 
sented, either alone or in combination, 
but rather to the organization of the 
various components into a single struc- 
ture. It is this integration or interde- 
pendence of the discrete components 
of an experience which is ordinarily 
implied by the term “meaning” as it 
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James Н. REYNOLDS 


is used in describing cognitive e 
ture. The data indicate that it e af 
organizational or “meaning € ^ 
the Stage I task, rather than t ae 
miliarization with either the in э 
alone or the initials and the ps ves 
arately, which contributed to p p "s 
tive transfer demonstrated T this 
Cognitive Group. The fact. tha Бе 
transfer was obtained with simp s by 
terials which were easily learne jude 
rote makes it reasonable to oe is 
that, for human learners at leas БИШ 
organizational aspect is a pow есч 
factor in even basic learning proce 
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